Update README.md
Browse files
README.md
CHANGED
@@ -7,7 +7,7 @@ inference: false
|
|
7 |
|
8 |
<!-- Provide a quick summary of what the model is/does. -->
|
9 |
|
10 |
-
**dragon-qwen-7b-gguf** is a quantized version of a fact-based question answering model, optimized for complex business documents, fine-tuned on top of
|
11 |
|
12 |
|
13 |
To pull the model via API:
|
|
|
7 |
|
8 |
<!-- Provide a quick summary of what the model is/does. -->
|
9 |
|
10 |
+
**dragon-qwen-7b-gguf** is a quantized version of a fact-based question answering model, optimized for complex business documents, fine-tuned on top of Qwen2 7B base, and then packaged with 4_K_M GGUF quantization, providing a fast, small inference implementation for use on CPUs.
|
11 |
|
12 |
|
13 |
To pull the model via API:
|