allura-org
/

MoE-Girl-800MA-3BT

Text Generation

Mixture of Experts

Inference Endpoints

Model card Files Files and versions Community

inflatebot commited on 29 days ago

Commit

b015a1a

•

1 Parent(s): 02ca75d

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -19,8 +19,8 @@ A roleplay-centric finetune of IBM's Granite 3.0 3B-A800M. LoRA finetune trained
 PLEASE do not expect godliness out of this, it's a model with _800 million_ active parameters. Expect something more akin to GPT-3 (the original, not GPT-3.5.)
 (Furthermore, this version is by a less experienced tuner; it's my first finetune that actually has decent-looking graphs, I don't really know what I'm doing yet!)
 ## Quants
-Soon:tm:
 ## Prompting
 Use ChatML.
 ```

 PLEASE do not expect godliness out of this, it's a model with _800 million_ active parameters. Expect something more akin to GPT-3 (the original, not GPT-3.5.)
 (Furthermore, this version is by a less experienced tuner; it's my first finetune that actually has decent-looking graphs, I don't really know what I'm doing yet!)
 ## Quants
+[GGUFs available from mradermacher](https://huggingface.co/mradermacher/MoE-Girl-800MA-3BT-GGUF/tree/main) (thanks man)
+Note that Granite quants have been said to be unstable. Try running the FP16 if it outputs straight gibberish.
 ## Prompting
 Use ChatML.
 ```