metadata
language:
- en
- am
license: apache-2.0
tags:
- text-generation-inference
- transformers
- unsloth
- llama
- trl
- sft
base_model: unsloth/llama-3-8b-bnb-4bit
datasets:
- iocuydi/amharic-alpaca
- iocuydi/amharic-dolly-15k
Llama3 Amharic DPO
Amharic Llama3 8B Alpaca further DPO tuned on an amharic translated dolly-15k dataset to always respond in Amharic.
Very token inefficient.
- Developed by: simonbutt
- License: apache-2.0
- Finetuned from model:
- unsloth/llama-3-8b-bnb-4bit
- simonbutt/am_llama3_alpaca