InferenceIllusionist
/

CybersurferNyandroidLexicat-8x7B-iMat-GGUF

Not-For-All-Audiences

Inference Endpoints

Model card Files Files and versions Community

InferenceIllusionist commited on Mar 14

Commit

01d79eb

•

1 Parent(s): 206dd29

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -11,6 +11,8 @@ tags:
 Uses the same imat calculation method as the later batch of maid-yuzu-v8-alter-iMat-GGUF.
 For more information on latest iMatrix quants see this PR - https://github.com/ggerganov/llama.cpp/pull/5747
 <b>Tip:</b> The letter at the end of the quant name indicates its size. Larger sizes have better quality, smaller sizes are faster.

 Uses the same imat calculation method as the later batch of maid-yuzu-v8-alter-iMat-GGUF.
+<b>Legacy quants (i.e. Q5_K_M, Q6_K, etc) in this repo have all been enhanced with the imatrix calculation. No need for two separate repos.
 For more information on latest iMatrix quants see this PR - https://github.com/ggerganov/llama.cpp/pull/5747
 <b>Tip:</b> The letter at the end of the quant name indicates its size. Larger sizes have better quality, smaller sizes are faster.