Uploaded model
- Developed by: OsakanaTeishoku
- License: cc-by-nc-nd-4.0
- Finetuned from model : weblab-GENIAC/Tanuki-8B-dpo-v1.0
This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.
Model tree for OsakanaTeishoku/Tanuki-8B-dpo-v1.0-ogiri-adapter
Base model
weblab-GENIAC/Tanuki-8B-dpo-v1.0