mgonzs13 commited on
Commit
2ebe3eb
1 Parent(s): 49680c3

Upload ggml-model-q4_k_m.gguf

Browse files

Following the new quantizations from llama.cpp (https://github.com/ggerganov/llama.cpp/tree/master/examples/quantize), I have created the q4_k_m.

Files changed (2) hide show
  1. .gitattributes +1 -0
  2. ggml-model-q4_k_m.gguf +3 -0
.gitattributes CHANGED
@@ -3,3 +3,4 @@ oid sha256:52470d09f21823288a37312cba875d7924a3ef02aa5b4a50832162d584acc68c
3
  size 2293
4
  ggml-model-q4_0.gguf filter=lfs diff=lfs merge=lfs -text
5
  ggml-model-f16.gguf filter=lfs diff=lfs merge=lfs -text
 
 
3
  size 2293
4
  ggml-model-q4_0.gguf filter=lfs diff=lfs merge=lfs -text
5
  ggml-model-f16.gguf filter=lfs diff=lfs merge=lfs -text
6
+ ggml-model-q4_k_m.gguf filter=lfs diff=lfs merge=lfs -text
ggml-model-q4_k_m.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:807cc82aa51dec5cde12f0aca35df78a139de519cb989ca6fb0d0e13e890c4bb
3
+ size 1708595680