bullerwins
commited on
Commit
•
ddb996c
1
Parent(s):
4556979
Update README.md
Browse files
README.md
CHANGED
@@ -7,6 +7,8 @@ license_link: LICENSE
|
|
7 |
<!-- markdownlint-disable html -->
|
8 |
<!-- markdownlint-disable no-duplicate-header -->
|
9 |
|
|
|
|
|
10 |
GGUF quantize version of [DeepSeek-Coder-V2-Instruct-GGUF](https://huggingface.co/deepseek-ai/DeepSeek-Coder-V2-Instruct)
|
11 |
|
12 |
Using [llama.cpp c637fcd](https://github.com/ggerganov/llama.cpp/commit/c637fcd34d135a9ff4f97d3a53ad03a910a4a31f)
|
|
|
7 |
<!-- markdownlint-disable html -->
|
8 |
<!-- markdownlint-disable no-duplicate-header -->
|
9 |
|
10 |
+
NOTE: You might need to disable FA (Flash Attention) in llama.cpp to work properly.
|
11 |
+
|
12 |
GGUF quantize version of [DeepSeek-Coder-V2-Instruct-GGUF](https://huggingface.co/deepseek-ai/DeepSeek-Coder-V2-Instruct)
|
13 |
|
14 |
Using [llama.cpp c637fcd](https://github.com/ggerganov/llama.cpp/commit/c637fcd34d135a9ff4f97d3a53ad03a910a4a31f)
|