Reiterate3680
/

Aura-NeMo-12B-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Reiterate3680 commited on Aug 15

Commit

c05c20e

•

1 Parent(s): e171f84

Create README.md

Files changed (1) hide show

README.md +17 -0

README.md ADDED Viewed

	@@ -0,0 +1,17 @@

+---
+base_model: jeiku/Aura-NeMo-12B
+language:
+- en
+license: other
+pipeline_tag: text-generation
+quantized_by: Reiterate3680
+---
+Notes: I merged the adapter back with Unsloth. Looks like you should use the Mistral instruct format.
+L quants (or more), for fun/testing, probably prefer bartowski or mradermacher's quants if available
+Original Model: https://huggingface.co/jeiku/Aura-NeMo-12B
+Made with a modified version of https://huggingface.co/FantasiaFoundry/GGUF-Quantization-Script
+Q2_K_L, Q4_K_L, Q5_K_L, Q6_K_L, are using Q_8 output tensors and token embeddings. imatrix is done using bartowski's imatrix dataset