ugur6634 commited on
Commit
7d44638
1 Parent(s): d523747

Create README.md

Browse files

This model based llama-2 chat-hf and fine-tuned via qlora (peft). Then quantize via llama.cpp to q4k. There is no viable performance loss.

Files changed (1) hide show
  1. README.md +0 -0
README.md ADDED
File without changes