File size: 375 Bytes
7449d16
 
 
 
 
 
 
 
 
 
ae2e095
 
fb2d556
ae2e095
b1c942e
 
fb2d556
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
---
license: gemma
language:
- en
pipeline_tag: text-generation
tags:
- google
- gemma
- gguf
- imatrix
---

# Quant Infos

- f32 gguf is from the official kaggle repo
- imatrix quants are running and will be uploaded one-by-one
- you will need the gemma2 llama.cpp [PR](https://github.com/ggerganov/llama.cpp/pull/8156) applied to your llama.cpp

# Original Model Card

TODO