celsowm
/

auryn_dpo_orpo_english

Model card Files Files and versions Community

auryn_dpo_orpo_english / README.md

celsowm's picture

Update README.md

a14140c verified 20 days ago

|

history blame contribute delete

555 Bytes

	---
	license: apache-2.0
	datasets:
	- celsowm/auryn_dpo_orpo_english
	language:
	- en
	base_model:
	- meta-llama/Llama-3.2-1B
	tags:
	- orpo
	---
	# auryn_dpo_orpo_english

	This is a ORPO fine-tune of meta-llama/Llama-3.2-1b trained on three epochs of https://huggingface.co/datasets/celsowm/auryn_dpo_orpo_english

	Auryn is a fictional place intended to serve as a proof of concept for injecting knowledge into a large language model using ORPO.

	Tutorial here: https://medium.com/@celsoaf/injecting-new-knowledge-into-an-llm-via-fine-tuning-with-orpo-017d3bfdb11b