inflatebot commited on
Commit
b015a1a
1 Parent(s): 02ca75d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -19,8 +19,8 @@ A roleplay-centric finetune of IBM's Granite 3.0 3B-A800M. LoRA finetune trained
19
  PLEASE do not expect godliness out of this, it's a model with _800 million_ active parameters. Expect something more akin to GPT-3 (the original, not GPT-3.5.)
20
  (Furthermore, this version is by a less experienced tuner; it's my first finetune that actually has decent-looking graphs, I don't really know what I'm doing yet!)
21
  ## Quants
22
- Soon:tm:
23
-
24
  ## Prompting
25
  Use ChatML.
26
  ```
 
19
  PLEASE do not expect godliness out of this, it's a model with _800 million_ active parameters. Expect something more akin to GPT-3 (the original, not GPT-3.5.)
20
  (Furthermore, this version is by a less experienced tuner; it's my first finetune that actually has decent-looking graphs, I don't really know what I'm doing yet!)
21
  ## Quants
22
+ [GGUFs available from mradermacher](https://huggingface.co/mradermacher/MoE-Girl-800MA-3BT-GGUF/tree/main) (thanks man)
23
+ Note that Granite quants have been said to be unstable. Try running the FP16 if it outputs straight gibberish.
24
  ## Prompting
25
  Use ChatML.
26
  ```