inflatebot
commited on
Commit
•
b015a1a
1
Parent(s):
02ca75d
Update README.md
Browse files
README.md
CHANGED
@@ -19,8 +19,8 @@ A roleplay-centric finetune of IBM's Granite 3.0 3B-A800M. LoRA finetune trained
|
|
19 |
PLEASE do not expect godliness out of this, it's a model with _800 million_ active parameters. Expect something more akin to GPT-3 (the original, not GPT-3.5.)
|
20 |
(Furthermore, this version is by a less experienced tuner; it's my first finetune that actually has decent-looking graphs, I don't really know what I'm doing yet!)
|
21 |
## Quants
|
22 |
-
|
23 |
-
|
24 |
## Prompting
|
25 |
Use ChatML.
|
26 |
```
|
|
|
19 |
PLEASE do not expect godliness out of this, it's a model with _800 million_ active parameters. Expect something more akin to GPT-3 (the original, not GPT-3.5.)
|
20 |
(Furthermore, this version is by a less experienced tuner; it's my first finetune that actually has decent-looking graphs, I don't really know what I'm doing yet!)
|
21 |
## Quants
|
22 |
+
[GGUFs available from mradermacher](https://huggingface.co/mradermacher/MoE-Girl-800MA-3BT-GGUF/tree/main) (thanks man)
|
23 |
+
Note that Granite quants have been said to be unstable. Try running the FP16 if it outputs straight gibberish.
|
24 |
## Prompting
|
25 |
Use ChatML.
|
26 |
```
|