eren23
/

DPOMixLLama-3-8B-lora

Text Generation

text-generation-inference

Model card Files Files and versions Community

Edit model card

A finetuning experiment on llama3 8b it with selected 5k examples from argilla dpo 7k

Downloads last month: 0

Model tree for eren23/DPOMixLLama-3-8B-lora

Base model

meta-llama/Meta-Llama-3-8B-Instruct

Adapter

(522)

this model

Dataset used to train eren23/DPOMixLLama-3-8B-lora