File size: 573 Bytes
7449d16 8943cdc ae2e095 fb2d556 ae2e095 b1c942e fb2d556 860a91e fb2d556 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 |
---
license: gemma
language:
- en
pipeline_tag: text-generation
tags:
- google
- gemma
- gguf
- imatrix
base_model: google/gemma-2-9b-it
---
# Quant Infos
- f32 gguf is from the official kaggle repo
- imatrix quants are running and will be uploaded one-by-one
- you will need the gemma2 llama.cpp [PR](https://github.com/ggerganov/llama.cpp/pull/8156) applied to your llama.cpp
- current quants are based on the f32 gguf provided by google directly, I will reconvert from the huggingface repo once the dust settles to get better gguf metadata
# Original Model Card
TODO |