MedQA_L3_350steps_1e7rate_01beta_CSFTDPO / model-00002-of-00004.safetensors

Commit History

End of training
1837a30
verified

tsavage68 commited on