Update README.md
Browse files
README.md
CHANGED
@@ -6,7 +6,9 @@ license_link: https://huggingface.co/Qwen/Qwen-72B-Chat/blob/main/LICENSE
|
|
6 |
|
7 |
This is 2-bit quantization of [Qwen/Qwen-72B-Chat](https://huggingface.co/Qwen/Qwen-72B-Chat) using [QuIP#](https://cornell-relaxml.github.io/quip-sharp/)
|
8 |
|
9 |
-
Random samples from C4 are used as calibration data.
|
|
|
|
|
10 |
|
11 |
## Model loading
|
12 |
Please follow the instruction of [QuIP-for-all](https://github.com/chu-tianxiang/QuIP-for-all) for usage.
|
|
|
6 |
|
7 |
This is 2-bit quantization of [Qwen/Qwen-72B-Chat](https://huggingface.co/Qwen/Qwen-72B-Chat) using [QuIP#](https://cornell-relaxml.github.io/quip-sharp/)
|
8 |
|
9 |
+
Random samples from C4 are used as calibration data.
|
10 |
+
|
11 |
+
For Chinese related tasks, please use the [zh branch](https://huggingface.co/keyfan/Qwen-72B-Chat-2bit/tree/zh) instead, which use bilingual text from C4 and SkyPile as calibration data.
|
12 |
|
13 |
## Model loading
|
14 |
Please follow the instruction of [QuIP-for-all](https://github.com/chu-tianxiang/QuIP-for-all) for usage.
|