qwp4w3hyb
/

gemma-2-9b-it-iMat-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

qwp4w3hyb commited on Jun 27

Commit

f6ef661

•

1 Parent(s): 973f8fa

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ base_model: google/gemma-2-9b-it
 - f32 gguf is from the official kaggle repo
 - imatrix quants are running and will be uploaded one-by-one
 - you will need the gemma2 llama.cpp [PR](https://github.com/ggerganov/llama.cpp/pull/8156) applied to your llama.cpp
-- current quants are based on the f32 gguf provided by google directly, I will reconvert from transformers once the dust settles to get better gguf metadata ...
 # Original Model Card

 - f32 gguf is from the official kaggle repo
 - imatrix quants are running and will be uploaded one-by-one
 - you will need the gemma2 llama.cpp [PR](https://github.com/ggerganov/llama.cpp/pull/8156) applied to your llama.cpp
+- current quants are based on the f32 gguf provided by google directly, I will reconvert from transformers once the dust settles to get better gguf metadata
 # Original Model Card