DavidAU commited on
Commit
1d98bca
1 Parent(s): e678b0c

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +106 -0
README.md ADDED
@@ -0,0 +1,106 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ tags:
6
+ - story
7
+ - general usage
8
+ - ultra high precision
9
+ ---
10
+ <B>NEO CLASS Ultra "X" Quants for : TinyLlama-1.1B-Chat-v1.0-Ultra-NEO-V1-Imatrix-GGUF</B>
11
+
12
+ The NEO Class tech was created after countless investigations and over 120 lab experiments backed by
13
+ real world testing and qualitative results.
14
+
15
+ <b>NEO Class results: </b>
16
+
17
+ Better overall function, instruction following, output quality and stronger connections to ideas, concepts and the world in general.
18
+
19
+ In addition quants now operate above their "grade" so to speak :
20
+
21
+ IE: IQ4 operate at Q5KM/Q6 levels.
22
+
23
+ Perplexity drop of 591 points for Neo Class Imatrix quant of IQ4XS VS regular quant of IQ4XS.
24
+
25
+ (lower is better)
26
+
27
+ <B> What are "X" Quants? </B>
28
+
29
+ The "X" quants in this repo are quants at IQ4XS which have been modified at the time of quanting.
30
+
31
+ There are examples of output below from each "X" quant with give you a rough idea of differences between them.
32
+
33
+ This is a guide only.
34
+
35
+ Although "TinyLlama" is a capable model, it is limited and therefore there will be
36
+ limited variations between "X" quants, Neo Imatrix Quants and standard quants.
37
+
38
+ Other models of higher parameter counts show much stronger differences as well as capabilities.
39
+
40
+ In addition at this repo there is a "regular non-NEO/non X quant" and an Ultra Neo non "X quant"
41
+ for usage and/or comparison purposes.
42
+
43
+ Because "X" quants operate slightly differently than standard quants I suggest you download a number
44
+ of them for testing as they also differ in function between themselves too.
45
+
46
+ There are 11 "X" quants in this repo, and denoted by a four digit number (IE "0001")
47
+ at the end of the file name.
48
+
49
+ For testing it is suggested to use 3 "no right answer" prompts and 3 standard limited answer prompts
50
+ related to your use case(s) with a setting "temp=0" to allow consistent testing.
51
+
52
+ For Ultra NEO quants (all quants) of this model please go here:
53
+
54
+ [ https://huggingface.co/DavidAU/TinyLlama-1.1B-Chat-v1.0-Ultra-NEO-V1-Imatrix-GGUF ]
55
+
56
+ <B> Model Notes: </B>
57
+
58
+ Maximum context is 2k. Please see original model maker's page for details, and usage information for this model.
59
+
60
+ Special thanks to the model creators at TinyLLama for making such a fantastic model:
61
+
62
+ [ https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v1.0 ]
63
+
64
+ <h3>EXAMPLES:</h3>
65
+
66
+ <font color="red"> TEST PROMPT (no right answer): Give me 3 fictional reasons the Earth's sun went supernova, in vivid and exacting detail of 500 words EACH PER REASON including details of what happens when the sun goes supernova. </font>
67
+
68
+ <B>Standard non alternated IQ4XS</b>
69
+
70
+ <B>Imatrix NEO IQ4XS</b>
71
+
72
+
73
+ <B>Imatrix NEO X Quant IQ4XS "0001"</b>
74
+
75
+
76
+ <B>Imatrix NEO X Quant IQ4XS "0001"</b>
77
+
78
+
79
+
80
+ <B>Imatrix NEO X Quant IQ4XS "0001"</b>
81
+
82
+
83
+ <B>Imatrix NEO X Quant IQ4XS "0001"</b>
84
+
85
+
86
+ <B>Imatrix NEO X Quant IQ4XS "0100"</b>
87
+
88
+
89
+ <B>Imatrix NEO X Quant IQ4XS "0101"</b>
90
+
91
+
92
+ <B>Imatrix NEO X Quant IQ4XS "0102"</b>
93
+
94
+
95
+ <B>Imatrix NEO X Quant IQ4XS "0200"</b>
96
+
97
+
98
+ <B>Imatrix NEO X Quant IQ4XS "0201"</b>
99
+
100
+
101
+ <B>Imatrix NEO X Quant IQ4XS "0202"</b>
102
+
103
+
104
+ <B>Imatrix NEO X Quant IQ4XS "0203"</b>
105
+
106
+