ikawrakow
/

qwen-14b-chat-gguf

Inference Endpoints

Model card Files Files and versions Community

qwen-14b-chat-gguf / README.md

ikawrakow's picture

Update README.md

808c956 verified 10 months ago

|

history blame contribute delete

295 Bytes

	---
	license: apache-2.0
	---

	Posting these Qwen-14B-Chat quantized models in GGUF format for use with `llama.cpp` due to a user request.

	But, having used an importance matrix derived from English-only training data in the quantization, I have no idea how these models will perform in Chinese.