Text Generation
GGUF
English
creative
creative writing
fiction writing
plot generation
sub-plot generation
story generation
scene continue
storytelling
fiction story
story
writing
fiction
roleplaying
horror
general usage
roleplay
neo quant
fantasy
story telling
ultra high precision
Inference Endpoints
imatrix
conversational
Update README.md
Browse files
README.md
CHANGED
@@ -82,7 +82,8 @@ One version is not stronger than the other, they are different and result in dif
|
|
82 |
|
83 |
<B>Recommended Quants:</B>
|
84 |
|
85 |
-
This chart shows the order in terms of "BPW" for each quant
|
|
|
86 |
|
87 |
<small>
|
88 |
<PRE>
|
@@ -100,6 +101,8 @@ More BPW mean better quality, but higher VRAM requirements (and larger file size
|
|
100 |
The larger the model in terms of parameters the lower the size of quant you can run with less quality losses.
|
101 |
Note that "quality losses" refers to both instruction following and output quality.
|
102 |
|
|
|
|
|
103 |
(not all quants may be at this repo)
|
104 |
|
105 |
Suggestions for this model:
|
|
|
82 |
|
83 |
<B>Recommended Quants:</B>
|
84 |
|
85 |
+
This chart shows the order in terms of "BPW" for each quant (mapped below with relative "strength" to one another)
|
86 |
+
with "IQ1_S" with the least, and "Q8_0" with the most:
|
87 |
|
88 |
<small>
|
89 |
<PRE>
|
|
|
101 |
The larger the model in terms of parameters the lower the size of quant you can run with less quality losses.
|
102 |
Note that "quality losses" refers to both instruction following and output quality.
|
103 |
|
104 |
+
Differences (quality) between quants at lower levels are larger relative to higher quants differences.
|
105 |
+
|
106 |
(not all quants may be at this repo)
|
107 |
|
108 |
Suggestions for this model:
|