Joseph717171
commited on
Commit
•
76e21df
1
Parent(s):
da6ec4c
Update README.md
Browse files
README.md
CHANGED
@@ -6,9 +6,11 @@ author: froggeric (https://huggingface.co/datasets/froggeric/imatrix/edit/main/R
|
|
6 |
# All credit for this wonderful Repo Card detailing and explaining the similarities and differences of computed imatrices and detailing and explaining the differences, similarities, and, highlighted significances of training datasets and their purported purposes for particular large language models, goes to froggeric.
|
7 |
# Input files for generating the Importance Matrix
|
8 |
|
9 |
-
# Note: All uploaded imatrices to
|
10 |
-
|
11 |
|
|
|
|
|
|
|
12 |
|
13 |
## Which file to use for generating the importance matrix
|
14 |
|
|
|
6 |
# All credit for this wonderful Repo Card detailing and explaining the similarities and differences of computed imatrices and detailing and explaining the differences, similarities, and, highlighted significances of training datasets and their purported purposes for particular large language models, goes to froggeric.
|
7 |
# Input files for generating the Importance Matrix
|
8 |
|
9 |
+
# Note: All uploaded imatrices to this repo are pre-computed, and are ready to be used in llama.cpp's quantization process.
|
|
|
10 |
|
11 |
+
```
|
12 |
+
llama.cpp % ./quantize --imatrix path_to_imatrix path_to_model_files model_name-QuantType.gguf QuantType
|
13 |
+
```
|
14 |
|
15 |
## Which file to use for generating the importance matrix
|
16 |
|