Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Andyrasika
's Collections
computation
Fine-Tuning
Ankush Collection
RAG articles
multimodal
Time series
Audio
Reinforcement Learning
Transformers
Stable Diffusion
cool models
Synthetic Datasets
Fine-Tuning
updated
12 days ago
Fine-Tuning
Upvote
1
Direct Judgement Preference Optimization
Paper
•
2409.14664
•
Published
Sep 23
Upvote
1
Share collection
View history
Collection guide
Browse collections