Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
AY2324S2-CS4248-Team-47
/
StableLM-DPO-Ultrafeedback
like
0
Follow
AY2324S2-CS4248-Team-47
5
PEFT
Safetensors
arxiv:
1910.09700
Model card
Files
Files and versions
Community
Use this model
main
StableLM-DPO-Ultrafeedback
/
optimizer.pt
Commit History
Fix: upload best checkpoint
b744dd9
JayanthB
commited on
Apr 17
Init DPO on Ultrafeedback dataset
3be625b
JayanthB
commited on
Apr 17