Update README.md
Browse files
README.md
CHANGED
@@ -31,7 +31,7 @@ license: cc-by-nc-4.0
|
|
31 |
- quants done with an importance matrix for improved quantization loss
|
32 |
- 0, K & IQ quants in basically all variants from Q8 down to IQ1_S
|
33 |
- Quantized with [llama.cpp](https://github.com/ggerganov/llama.cpp) commit [04976db7a819fcf8bfefbfc09a3344210b79dd27](https://github.com/ggerganov/llama.cpp/commit/04976db7a819fcf8bfefbfc09a3344210b79dd27) (master from 2024-05-07)
|
34 |
-
-
|
35 |
```
|
36 |
./imatrix -c 512 -m $model_name-f16.gguf -f $llama_cpp_path/groups_merged.txt -o $out_path/imat-f16-gmerged.dat
|
37 |
```
|
|
|
31 |
- quants done with an importance matrix for improved quantization loss
|
32 |
- 0, K & IQ quants in basically all variants from Q8 down to IQ1_S
|
33 |
- Quantized with [llama.cpp](https://github.com/ggerganov/llama.cpp) commit [04976db7a819fcf8bfefbfc09a3344210b79dd27](https://github.com/ggerganov/llama.cpp/commit/04976db7a819fcf8bfefbfc09a3344210b79dd27) (master from 2024-05-07)
|
34 |
+
- Imatrix generated with [this](https://github.com/ggerganov/llama.cpp/discussions/5263#discussioncomment-8395384) dataset.
|
35 |
```
|
36 |
./imatrix -c 512 -m $model_name-f16.gguf -f $llama_cpp_path/groups_merged.txt -o $out_path/imat-f16-gmerged.dat
|
37 |
```
|