DPO fine-tuned models Family, high performance
-
jpacifico/Chocolatine-14B-Instruct-DPO-v1.2
Text Generation • Updated • 6.3k • 13 -
jpacifico/Chocolatine-3B-Instruct-DPO-Revised
Text Generation • Updated • 1.39k • 25 -
jpacifico/Chocolatine-3B-Instruct-DPO-v1.2
Text Generation • Updated • 2.65k • 6 -
jpacifico/Chocolatine-3B-Instruct-DPO-Revised-Q8_0-GGUF
Text Generation • Updated • 60 • 2