leafspark
/

DeepSeek-V2-Chat-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

leafspark commited on May 18

Commit

215480b

•

1 Parent(s): e362e9c

Update README.md

Files changed (1) hide show

README.md +7 -2

README.md CHANGED Viewed

@@ -20,5 +20,10 @@ Using llama.cpp fork: [https://github.com/fairydreaming/llama.cpp/tree/deepseek-
 - Merged GGUF should appear
 # Quants:
-- bf16
-- q8_0

 - Merged GGUF should appear
 # Quants:
+- bf16 (generating, 85% complete)
+- f16 (after q4_k_m, but just use bf16)
+- f32 (may require some time to upload, after q8_0)
+- q8_0 (after bf16)
+- q4_k_m (after q8_0)
+If quantize.exe supports it I will make RTN quants.