JustinLin610
commited on
Commit
•
c0951ae
1
Parent(s):
ff2fb40
Update README.md
Browse files
README.md
CHANGED
@@ -10,7 +10,7 @@ tags:
|
|
10 |
- chat
|
11 |
---
|
12 |
|
13 |
-
# Qwen2-beta-Chat
|
14 |
|
15 |
|
16 |
## Introduction
|
@@ -67,7 +67,7 @@ generated_ids = [output_ids[len(input_ids):] for input_ids, output_ids in zip(mo
|
|
67 |
response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
|
68 |
```
|
69 |
|
70 |
-
For quantized models, we advise you to use the GPTQ, AWQ, and GGUF correspondents, namely `Qwen-beta-
|
71 |
|
72 |
|
73 |
## Limitations
|
|
|
10 |
- chat
|
11 |
---
|
12 |
|
13 |
+
# Qwen2-beta-1.8B-Chat
|
14 |
|
15 |
|
16 |
## Introduction
|
|
|
67 |
response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
|
68 |
```
|
69 |
|
70 |
+
For quantized models, we advise you to use the GPTQ, AWQ, and GGUF correspondents, namely `Qwen-beta-1_8B-Chat-GPTQ`, `Qwen-beta-1_8B-Chat-AWQ`, and `Qwen-beta-1_8B-Chat-GGUF`.
|
71 |
|
72 |
|
73 |
## Limitations
|