simonbutt
/

am_llama3_dpo

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

am_llama3_dpo / README.md

simonbutt's picture

Update README.md

2b0ac10 verified 7 months ago

|

history blame contribute delete

800 Bytes

	---
	language:
	- en
	- am
	license: apache-2.0
	tags:
	- text-generation-inference
	- transformers
	- unsloth
	- llama
	- trl
	- sft
	base_model: unsloth/llama-3-8b-bnb-4bit
	datasets:
	- iocuydi/amharic-alpaca
	- iocuydi/amharic-dolly-15k
	---

	# Llama3 Amharic DPO

	[Amharic Llama3 8B Alpaca](simonbutt/am_llama3_alpaca) further DPO tuned on an amharic translated dolly-15k [dataset](https://huggingface.co/datasets/iocuydi/amharic-dolly-15k) to always respond in Amharic.

	Very token inefficient.

	- Developed by: simonbutt
	- License: apache-2.0
	- Finetuned from model:
	- unsloth/llama-3-8b-bnb-4bit
	- simonbutt/am_llama3_alpaca

	[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)