Update README.md
Browse files
README.md
CHANGED
@@ -28,7 +28,7 @@ Qwen1.5-MoE employs Mixture of Experts (MoE) architecture, where the models are
|
|
28 |
We pretrained the models with a large amount of data, and we post-trained the models with both supervised finetuning and direct preference optimization. However, DPO leads to improvements in human preference evaluation but degradation in benchmark evaluation. In the very near future, we will fix both problems.
|
29 |
|
30 |
## Requirements
|
31 |
-
The code of Qwen1.5-MoE has been in the latest Hugging face transformers and we advise you to install
|
32 |
```
|
33 |
KeyError: 'qwen2_moe'.
|
34 |
```
|
@@ -71,7 +71,7 @@ generated_ids = [
|
|
71 |
response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
|
72 |
```
|
73 |
|
74 |
-
For quantized models, we advise you to use the GPTQ
|
75 |
|
76 |
|
77 |
## Tips
|
|
|
28 |
We pretrained the models with a large amount of data, and we post-trained the models with both supervised finetuning and direct preference optimization. However, DPO leads to improvements in human preference evaluation but degradation in benchmark evaluation. In the very near future, we will fix both problems.
|
29 |
|
30 |
## Requirements
|
31 |
+
The code of Qwen1.5-MoE has been in the latest Hugging face transformers and we advise you to build from source with command `pip install git+https://github.com/huggingface/transformers`, or you might encounter the following error:
|
32 |
```
|
33 |
KeyError: 'qwen2_moe'.
|
34 |
```
|
|
|
71 |
response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
|
72 |
```
|
73 |
|
74 |
+
For quantized models, we advise you to use the GPTQ correspondents, namely `Qwen1.5-MoE-A2.7B-Chat-GPTQ-Int4`.
|
75 |
|
76 |
|
77 |
## Tips
|