Upload Minisun Trained using model.fit on NeelNanda/pile-10k[0-5000],lr 1e-4,cw 128,2 epoch,batch size 8,cosine with restart
2418aff
verified
finnstrom3693
commited on