Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
sophiex
/
dpo_pythia1b_hh_rlhf.yml_local_29-04-24_13-31-33_xxxxx
like
0
PEFT
Safetensors
Generated from Trainer
Model card
Files
Files and versions
Community
Use this model
main
dpo_pythia1b_hh_rlhf.yml_local_29-04-24_13-31-33_xxxxx
/
special_tokens_map.json
Commit History
Training in progress, step 503
a07c5a1
verified
sophiex
commited on
Apr 29