HiroseKoichi
/

Llama-Salad-4x8B-V2

Text Generation

nsfw

Not-For-All-Audiences

text-generation-inference

Mixture of Experts

Inference Endpoints

Model card Files Files and versions Community

HiroseKoichi commited on May 31

Commit

81d3858

•

1 Parent(s): c79eb93

Update README.md

Files changed (1) hide show

README.md +6 -0

README.md CHANGED Viewed

@@ -18,6 +18,12 @@ V2 has improvements in all areas from V1; it's not a massive improvement, but I
 I really like the model selection in this one, so I don't know how much more I can improve if I make another 4x8B merge. If I were to make a V3, swapping Meta-Llama-3-8B-Instruct would likely be the only change. I will try my hand at making an 8x8B merge in the future, but I still need to find some models to fill the gaps; making sure there's no routing conflicts between 8 different models at once will be the biggest challenge.
 # Details
 - **License**: [llama3](https://llama.meta.com/llama3/license/)

 I really like the model selection in this one, so I don't know how much more I can improve if I make another 4x8B merge. If I were to make a V3, swapping Meta-Llama-3-8B-Instruct would likely be the only change. I will try my hand at making an 8x8B merge in the future, but I still need to find some models to fill the gaps; making sure there's no routing conflicts between 8 different models at once will be the biggest challenge.
+# Quantization Formats
+**GGUF**
+- Static:
+    - https://huggingface.co/mradermacher/Llama-Salad-4x8B-V2-GGUF
+- Imatrix:
+    - https://huggingface.co/mradermacher/Llama-Salad-4x8B-V2-i1-GGUF
 # Details
 - **License**: [llama3](https://llama.meta.com/llama3/license/)