gpt_train_2_768 / train_results.json
gokulsrinivasagan's picture
End of training
19d774d verified
raw
history blame contribute delete
252 Bytes
{
"epoch": 0.03797345732829604,
"total_flos": 2185295466332160.0,
"train_loss": 8.552704480229592,
"train_runtime": 93913.2173,
"train_samples": 660643,
"train_samples_per_second": 703.461,
"train_steps_per_second": 10.992
}