TheBloke
/

SauerkrautLM-Mixtral-8x7B-Instruct-AWQ

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions Community

TheBloke commited on Jan 3

Commit

ffd46b4

•

1 Parent(s): d5488b3

Upload README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -63,7 +63,7 @@ This is a Mixtral AWQ model.
 For AutoAWQ inference, please install AutoAWQ 0.1.8 or later.
-Support via Transformers is coming soon, via this PR: https://github.com/huggingface/transformers/pull/27950 which should be merged to Transformers `main` very soon.
 vLLM: version 0.2.6 is confirmed to support Mixtral AWQs.
@@ -113,7 +113,7 @@ Models are released as sharded safetensors files.
 | Branch | Bits | GS | AWQ Dataset | Seq Len | Size |
 | ------ | ---- | -- | ----------- | ------- | ---- |
-| [main](https://huggingface.co/TheBloke/SauerkrautLM-Mixtral-8x7B-Instruct-AWQ/tree/main) | 4 | 128 | [VMware Open Instruct](https://huggingface.co/datasets/VMware/open-instruct/viewer/) | 8192 | 24.65 GB
 <!-- README_AWQ.md-provided-files end -->

 For AutoAWQ inference, please install AutoAWQ 0.1.8 or later.
+Support via Transformers is also available, but currently requires installing Transformers from Github: `pip3 install git+https://github.com/huggingface/transformers.git`
 vLLM: version 0.2.6 is confirmed to support Mixtral AWQs.
 | Branch | Bits | GS | AWQ Dataset | Seq Len | Size |
 | ------ | ---- | -- | ----------- | ------- | ---- |
+| [main](https://huggingface.co/TheBloke/SauerkrautLM-Mixtral-8x7B-Instruct-AWQ/tree/main) | 4 | 128 | [German Quad](https://huggingface.co/datasets/deepset/germanquad/viewer/) | 8192 | 24.65 GB
 <!-- README_AWQ.md-provided-files end -->