legraphista
commited on
Commit
•
9054864
1
Parent(s):
2e0bb66
Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
@@ -33,7 +33,7 @@ _Llama.cpp imatrix quantization of google/gemma-2-9b-it_
|
|
33 |
|
34 |
Original Model: [google/gemma-2-9b-it](https://huggingface.co/google/gemma-2-9b-it)
|
35 |
Original dtype: `BF16` (`bfloat16`)
|
36 |
-
Quantized by:
|
37 |
IMatrix dataset: [here](https://gist.githubusercontent.com/bartowski1182/eb213dccb3571f863da82e99418f81e8/raw/b2869d80f5c16fd7082594248e80144677736635/calibration_datav3.txt)
|
38 |
|
39 |
- [Files](#files)
|
|
|
33 |
|
34 |
Original Model: [google/gemma-2-9b-it](https://huggingface.co/google/gemma-2-9b-it)
|
35 |
Original dtype: `BF16` (`bfloat16`)
|
36 |
+
Quantized by: [https://github.com/ggerganov/llama.cpp/pull/8156](https://github.com/ggerganov/llama.cpp/pull/8156)
|
37 |
IMatrix dataset: [here](https://gist.githubusercontent.com/bartowski1182/eb213dccb3571f863da82e99418f81e8/raw/b2869d80f5c16fd7082594248e80144677736635/calibration_datav3.txt)
|
38 |
|
39 |
- [Files](#files)
|