--- library_name: transformers base_model: - UCLA-AGI/Gemma-2-9B-It-SPPO-Iter3 datasets: - jondurbin/gutenberg-dpo-v0.1 license: gemma --- # gemma2-gutenberg-9B [UCLA-AGI/Gemma-2-9B-It-SPPO-Iter3](https://huggingface.co/UCLA-AGI/Gemma-2-9B-It-SPPO-Iter3) finetuned on [jondurbin/gutenberg-dpo-v0.1](https://huggingface.co/datasets/jondurbin/gutenberg-dpo-v0.1). ### Method Finetuned using an RTX 4090 using ORPO for 3 epochs. [Fine-tune Llama 3 with ORPO](https://mlabonne.github.io/blog/posts/2024-04-19_Fine_tune_Llama_3_with_ORPO.html)