ugur6634
/

oco-ft-q4

ugur6634 commited on Dec 22, 2023

Commit

7d44638

•

1 Parent(s): d523747

Create README.md

This model based llama-2 chat-hf and fine-tuned via qlora (peft). Then quantize via llama.cpp to q4k. There is no viable performance loss.

Files changed (1) hide show

README.md ADDED Viewed

File without changes