shirwu
/

content

Generated from Trainer

Model card Files Files and versions Community

Commit History

shirwu/dpo-personal-preference-llama3.2-1b-trainer

e0257e3
verified

shirwu commited on Oct 19

initial commit

10f9c15
verified

shirwu commited on Oct 19