Reiterate3680
commited on
Commit
•
c05c20e
1
Parent(s):
e171f84
Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,17 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
base_model: jeiku/Aura-NeMo-12B
|
3 |
+
language:
|
4 |
+
- en
|
5 |
+
license: other
|
6 |
+
pipeline_tag: text-generation
|
7 |
+
quantized_by: Reiterate3680
|
8 |
+
---
|
9 |
+
Notes: I merged the adapter back with Unsloth. Looks like you should use the Mistral instruct format.
|
10 |
+
|
11 |
+
L quants (or more), for fun/testing, probably prefer bartowski or mradermacher's quants if available
|
12 |
+
|
13 |
+
Original Model: https://huggingface.co/jeiku/Aura-NeMo-12B
|
14 |
+
|
15 |
+
Made with a modified version of https://huggingface.co/FantasiaFoundry/GGUF-Quantization-Script
|
16 |
+
|
17 |
+
Q2_K_L, Q4_K_L, Q5_K_L, Q6_K_L, are using Q_8 output tensors and token embeddings. imatrix is done using bartowski's imatrix dataset
|