llmware
/

dragon-qwen-7b-gguf

Model card Files Files and versions Community

doberst commited on Aug 22

Commit

5e3037e

•

1 Parent(s): 14796a6

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -7,7 +7,7 @@ inference: false
 <!-- Provide a quick summary of what the model is/does. -->
-**dragon-qwen-7b-gguf** is a quantized version of a fact-based question answering model, optimized for complex business documents, fine-tuned on top of Qwen 7B base, and then packaged with 4_K_M GGUF quantization, providing a fast, small inference implementation for use on CPUs.
 To pull the model via API:

 <!-- Provide a quick summary of what the model is/does. -->
+**dragon-qwen-7b-gguf** is a quantized version of a fact-based question answering model, optimized for complex business documents, fine-tuned on top of Qwen2 7B base, and then packaged with 4_K_M GGUF quantization, providing a fast, small inference implementation for use on CPUs.
 To pull the model via API: