metadata
library_name: transformers
base_model:
- mlabonne/Meta-Llama-3.1-8B-Instruct-abliterated
datasets:
- flammenai/casual-conversation-DPO
license: llama3
llama3.1-cc-8B
mlabonne/Meta-Llama-3.1-8B-Instruct-abliterated finetuned on flammenai/casual-conversation-DPO.
This is an experimental finetune that formats the conversation data sequentially with the Llama 3 template.
Method
Finetuned using an A100 on Google Colab for 3 epochs.