CameronRedmore commited on
Commit
f50d898
1 Parent(s): 900a2e2

Update README with quantisation info.

Browse files
Files changed (1) hide show
  1. README.md +19 -1
README.md CHANGED
@@ -9,7 +9,25 @@ tags:
9
  - merge
10
  license: gemma
11
  ---
12
- # Gemma-2-Ataraxy-9B
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
13
 
14
  ![Ataraxy](https://i.imgur.com/aP03a5d.png)
15
 
 
9
  - merge
10
  license: gemma
11
  ---
12
+ # Gemma-2-Ataraxy-9B-exl2
13
+
14
+
15
+ This repository contains various EXL2 quantisations of [lemon07r/Gemma-2-Ataraxy-9B](https://huggingface.co/lemon07r/Gemma-2-Ataraxy-9B).
16
+
17
+ Quantisations available:
18
+ | Branch | Description | Recommended |
19
+ | ------ | ------ | ------ |
20
+ | [2.0-bpw](https://huggingface.co/CameronRedmore/Gemma-2-Ataraxy-9B-exl2/tree/2.0-bpw) | 2 bits per weight | Low Quality - Smallest Available Quantisation |
21
+ | [3.0-bpw](https://huggingface.co/CameronRedmore/Gemma-2-Ataraxy-9B-exl2/tree/3.0-bpw) | 3 bits per weight | |
22
+ | [4.0-bpw](https://huggingface.co/CameronRedmore/Gemma-2-Ataraxy-9B-exl2/tree/4.0-bpw) | 4 bits per weight | ✔️ - Recommended for Low-VRAM Environments |
23
+ | [5.0-bpw](https://huggingface.co/CameronRedmore/Gemma-2-Ataraxy-9B-exl2/tree/5.0-bpw) | 5 bits per weight | |
24
+ | [6.0-bpw](https://huggingface.co/CameronRedmore/Gemma-2-Ataraxy-9B-exl2/tree/6.0-bpw) | 6 bits per weight | ✔️ - Best Quality / VRAM Balance |
25
+ | [6.5-bpw](https://huggingface.co/CameronRedmore/Gemma-2-Ataraxy-9B-exl2/tree/6.5-bpw) | 6.5 bits per weight | ✔️ - Near Perfect Quality, Slightly Higher VRAM Usage |
26
+ | [8.0-bpw](https://huggingface.co/CameronRedmore/Gemma-2-Ataraxy-9B-exl2/tree/8.0-bpw) | 8.0 bits per weight | Best Available Quality - Almost always unnecessary |
27
+
28
+ ---
29
+ # Original README:
30
+ ---
31
 
32
  ![Ataraxy](https://i.imgur.com/aP03a5d.png)
33