TheBloke
/

baichuan-7B-GPTQ

Text Generation

Inference Endpoints

4-bit precision

Model card Files Files and versions Community

TheBloke commited on Jun 19, 2023

Commit

47122ee

•

1 Parent(s): b564638

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -28,13 +28,14 @@ It is the result of quantising to 4bit using [AutoGPTQ](https://github.com/PanQi
 * [4-bit GPTQ models for GPU inference](https://huggingface.co/TheBloke/CAMEL-33B-Combined-Data-GPTQ)
 * [Unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/baichuan-inc/baichuan-7B)
-## Experimental first GPTQ, requires latest AutoGPTq code
 This is a first quantisation of a brand new model type.
 It will only work with AutoGPTQ, and only using the latest version of AutoGPTQ, compiled from source
 To merge this PR, please follow these steps to install the latest AutoGPTQ from source:
 **Linux**
 ```
 pip uninstall -y auto-gptq

 * [4-bit GPTQ models for GPU inference](https://huggingface.co/TheBloke/CAMEL-33B-Combined-Data-GPTQ)
 * [Unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/baichuan-inc/baichuan-7B)
+## Experimental first GPTQ, requires latest AutoGPTQ code
 This is a first quantisation of a brand new model type.
 It will only work with AutoGPTQ, and only using the latest version of AutoGPTQ, compiled from source
 To merge this PR, please follow these steps to install the latest AutoGPTQ from source:
 **Linux**
 ```
 pip uninstall -y auto-gptq