Update README.md
Browse files
README.md
CHANGED
@@ -3,6 +3,9 @@ tags:
|
|
3 |
- fp8
|
4 |
---
|
5 |
|
|
|
|
|
|
|
6 |
```
|
7 |
vllm (pretrained=nm-testing/Mixtral-8x7B-Instruct-v0.1-FP8), gen_kwargs: (None), limit: None, num_fewshot: 5, batch_size: 1
|
8 |
| Groups |Version|Filter|n-shot|Metric|Value | |Stderr|
|
|
|
3 |
- fp8
|
4 |
---
|
5 |
|
6 |
+
Mixtral-8x7B-Instruct-v0.1 quantized to FP8 weights and activations, meant to be deployed in vLLM.
|
7 |
+
|
8 |
+
Accuracy on MMLU:
|
9 |
```
|
10 |
vllm (pretrained=nm-testing/Mixtral-8x7B-Instruct-v0.1-FP8), gen_kwargs: (None), limit: None, num_fewshot: 5, batch_size: 1
|
11 |
| Groups |Version|Filter|n-shot|Metric|Value | |Stderr|
|