ThomasBaruzier
commited on
Commit
•
4891dca
1
Parent(s):
ff1aa1b
Update README.md
Browse files
README.md
CHANGED
@@ -25,6 +25,37 @@ All quants were made using the imatrix option and Bartowski's [calibration file]
|
|
25 |
|
26 |
# Perplexity table (the lower the better)
|
27 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
28 |
<hr>
|
29 |
|
30 |
# Qwen2.5-14B-Instruct
|
|
|
25 |
|
26 |
# Perplexity table (the lower the better)
|
27 |
|
28 |
+
| Quant | Size (MB) | PPL | Size (%) | Accuracy (%) | PPL error rate |
|
29 |
+
| ------- | --------- | ------- | -------- | ------------ | -------------- |
|
30 |
+
| IQ1_S | 3441 | 22.0082 | 12.21 | 27.14 | 0.16818 |
|
31 |
+
| IQ1_M | 3693 | 15.079 | 13.11 | 39.62 | 0.1106 |
|
32 |
+
| IQ2_XXS | 4114 | 9.6047 | 14.6 | 62.2 | 0.06625 |
|
33 |
+
| IQ2_XS | 4487 | 8.3649 | 15.92 | 71.41 | 0.05574 |
|
34 |
+
| IQ2_S | 4772 | 8.1942 | 16.93 | 72.9 | 0.0548 |
|
35 |
+
| IQ2_M | 5109 | 7.7261 | 18.13 | 77.32 | 0.05177 |
|
36 |
+
| Q2_K_S | 5148 | 8.0641 | 18.27 | 74.08 | 0.0549 |
|
37 |
+
| Q2_K | 5504 | 7.6005 | 19.53 | 78.6 | 0.05146 |
|
38 |
+
| IQ3_XXS | 5672 | 6.9285 | 20.13 | 86.22 | 0.04547 |
|
39 |
+
| IQ3_XS | 6088 | 6.721 | 21.6 | 88.88 | 0.04329 |
|
40 |
+
| Q3_K_S | 6352 | 6.8697 | 22.54 | 86.96 | 0.04576 |
|
41 |
+
| IQ3_S | 6383 | 6.6246 | 22.65 | 90.17 | 0.04285 |
|
42 |
+
| IQ3_M | 6597 | 6.6359 | 23.41 | 90.02 | 0.04256 |
|
43 |
+
| Q3_K_M | 7000 | 6.5281 | 24.84 | 91.51 | 0.043 |
|
44 |
+
| Q3_K_L | 7558 | 6.4323 | 26.82 | 92.87 | 0.04211 |
|
45 |
+
| IQ4_XS | 7744 | 6.2005 | 27.48 | 96.34 | 0.04022 |
|
46 |
+
| Q4_0 | 8149 | 6.2928 | 28.92 | 94.93 | 0.04095 |
|
47 |
+
| IQ4_NL | 8154 | 6.208 | 28.94 | 96.23 | 0.04032 |
|
48 |
+
| Q4_K_S | 8177 | 6.163 | 29.02 | 96.93 | 0.03976 |
|
49 |
+
| Q4_K_M | 8572 | 6.1311 | 30.42 | 97.43 | 0.03957 |
|
50 |
+
| Q4_1 | 8958 | 6.1674 | 31.79 | 96.86 | 0.03981 |
|
51 |
+
| Q5_K_S | 9791 | 6.0411 | 34.75 | 98.88 | 0.03886 |
|
52 |
+
| Q5_0 | 9817 | 6.0504 | 34.84 | 98.73 | 0.03895 |
|
53 |
+
| Q5_K_M | 10023 | 6.0389 | 35.57 | 98.92 | 0.03888 |
|
54 |
+
| Q5_1 | 10625 | 6.0366 | 37.71 | 98.96 | 0.03885 |
|
55 |
+
| Q6_K | 11564 | 6.0004 | 41.04 | 99.56 | 0.0386 |
|
56 |
+
| Q8_0 | 14975 | 5.9821 | 53.14 | 99.86 | 0.03842 |
|
57 |
+
| F16 | 28179 | 5.9737 | 100 | 100 | 0.03835 |
|
58 |
+
|
59 |
<hr>
|
60 |
|
61 |
# Qwen2.5-14B-Instruct
|