baebee commited on
Commit
7184fa4
1 Parent(s): 8e08011

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +25 -0
README.md CHANGED
@@ -5,3 +5,28 @@ datasets:
5
  - ehartford/dolphin
6
  ---
7
  This model is a finetuned version of the DeciLM-6b-instruct on the Dolphin GPT4 Dataset
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
5
  - ehartford/dolphin
6
  ---
7
  This model is a finetuned version of the DeciLM-6b-instruct on the Dolphin GPT4 Dataset
8
+
9
+ Please set naive_attention_prefil to true when loading this model.
10
+
11
+ **Example:**
12
+ ```
13
+ import torch
14
+ from transformers import AutoModelForCausalLM, AutoTokenizer, BitsAndBytesConfig, AutoTokenizer
15
+
16
+ model_name = "NewstaR/Porpoise-6b-instruct"
17
+
18
+ bnb_config = BitsAndBytesConfig(
19
+ load_in_4bit=True,
20
+ bnb_4bit_quant_type="nf4",
21
+ bnb_4bit_compute_dtype=torch.float16,
22
+ )
23
+
24
+ model = AutoModelForCausalLM.from_pretrained(
25
+ model_name,
26
+ quantization_config=bnb_config,
27
+ trust_remote_code=True,
28
+ naive_attention_prefill=True,
29
+ )
30
+ model.config.use_cache = False
31
+ ```
32
+