NuminaMath-7B-CoT / train_results.json
lewtun's picture
lewtun HF staff
Add AI-MO/deepseek-math-7b-sft-aimo_v31.24 checkpoint
4c9f488 verified
raw
history blame
236 Bytes
{
"epoch": 3.0,
"total_flos": 2049636776804352.0,
"train_loss": 0.42822988295175696,
"train_runtime": 30718.8165,
"train_samples": 863474,
"train_samples_per_second": 21.729,
"train_steps_per_second": 0.679
}