scaling / c4_original-d=1024_l=24_h=8-0.25

Commit History

overtraining model release
6ade3a7

sagadre commited on