DavidAU
/

Command-R-01-Ultra-NEO-DARK-HORROR-V1-V2-35B-IMATRIX-GGUF

Model card Files Files and versions Community

DavidAU commited on Aug 19

Commit

649c245

•

1 Parent(s): 4218fce

Update README.md

Files changed (1) hide show

README.md +4 -1

README.md CHANGED Viewed

@@ -84,6 +84,7 @@ One version is not stronger than the other, they are different and result in dif
 This chart shows the order in terms of "BPW" for each quant with "IQ1_S" with the least, and "Q8_0" with the most:
 IQ1_S 	| IQ1_M
 IQ2_XXS | IQ2_XS 	| Q2_K_S 	| IQ2_S 	| Q2_K  | IQ2_M
@@ -97,9 +98,11 @@ Q5_K_S	| Q5_K_M
 Q6_K
 Q8_0
-More BPW mean better quality, but higher VRAM requirements (and larger file size) and lower Tokens per second.
 The larger the model in terms of parameters the lower the size of quant you can run with less quality losses.
 (not all quants may be at this repo)

 This chart shows the order in terms of "BPW" for each quant with "IQ1_S" with the least, and "Q8_0" with the most:
+<PRE>
 IQ1_S 	| IQ1_M
 IQ2_XXS | IQ2_XS 	| Q2_K_S 	| IQ2_S 	| Q2_K  | IQ2_M
 Q6_K
 Q8_0
+</pre>
+More BPW mean better quality, but higher VRAM requirements (and larger file size) and lower tokens per second.
 The larger the model in terms of parameters the lower the size of quant you can run with less quality losses.
+Note that "quality losses" refers to both instruction following and output quality.
 (not all quants may be at this repo)