am_llama3_dpo / README.md
simonbutt's picture
Update README.md
2b0ac10 verified
metadata
language:
  - en
  - am
license: apache-2.0
tags:
  - text-generation-inference
  - transformers
  - unsloth
  - llama
  - trl
  - sft
base_model: unsloth/llama-3-8b-bnb-4bit
datasets:
  - iocuydi/amharic-alpaca
  - iocuydi/amharic-dolly-15k

Llama3 Amharic DPO

Amharic Llama3 8B Alpaca further DPO tuned on an amharic translated dolly-15k dataset to always respond in Amharic.

Very token inefficient.

  • Developed by: simonbutt
  • License: apache-2.0
  • Finetuned from model:
    • unsloth/llama-3-8b-bnb-4bit
    • simonbutt/am_llama3_alpaca