ugur6634 commited on
Commit
3932c5d
1 Parent(s): 7d44638

Create README.md

Browse files

This model based llama-2 chat-hf and fine-tuned via qlora (peft). Then quantize via llama.cpp to q4k. There is no viable performance loss.

Files changed (0) hide show