NewstaR
/

Porpoise-6b-instruct

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

baebee commited on Sep 17, 2023

Commit

7184fa4

•

1 Parent(s): 8e08011

Update README.md

Files changed (1) hide show

README.md +25 -0

README.md CHANGED Viewed

@@ -5,3 +5,28 @@ datasets:
 - ehartford/dolphin
 ---
 This model is a finetuned version of the DeciLM-6b-instruct on the Dolphin GPT4 Dataset

 - ehartford/dolphin
 ---
 This model is a finetuned version of the DeciLM-6b-instruct on the Dolphin GPT4 Dataset
+Please set naive_attention_prefil to true when loading this model.
+**Example:**
+```
+import torch
+from transformers import AutoModelForCausalLM, AutoTokenizer, BitsAndBytesConfig, AutoTokenizer
+model_name = "NewstaR/Porpoise-6b-instruct"
+bnb_config = BitsAndBytesConfig(
+    load_in_4bit=True,
+    bnb_4bit_quant_type="nf4",
+    bnb_4bit_compute_dtype=torch.float16,
+)
+model = AutoModelForCausalLM.from_pretrained(
+    model_name,
+    quantization_config=bnb_config,
+    trust_remote_code=True,
+    naive_attention_prefill=True,
+)
+model.config.use_cache = False
+```