Model Card for free-llama-dpo-v0.2
Developed by : Freewheelin AI Technical Team
Hardware and Software
- Training Factors: We fine-tuned this model using the HuggingFace TRL Trainer
Method
- This model was trained using the learning method introduced in the SOLAR paper.
- Downloads last month
- 4,326
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.