Update README.md
Browse files
README.md
CHANGED
@@ -7,6 +7,10 @@ iMatrix gguf quants of a newer finetune of Mixtral-8x22B
|
|
7 |
|
8 |
EdgeQuants still underway, IQ4XS version recommended. Make sure to combine/merge the parts back together before using
|
9 |
|
|
|
10 |
```
|
11 |
cat tessIQ4XS.gguf.part* > tessIQ4XS.gguf
|
12 |
-
```
|
|
|
|
|
|
|
|
7 |
|
8 |
EdgeQuants still underway, IQ4XS version recommended. Make sure to combine/merge the parts back together before using
|
9 |
|
10 |
+
|
11 |
```
|
12 |
cat tessIQ4XS.gguf.part* > tessIQ4XS.gguf
|
13 |
+
```
|
14 |
+
|
15 |
+
|
16 |
+
Then use with llama.cpp version from April 12 or older. April 13 release had massive changes and messed up inferene for MoE models
|