qa-indo-math-k

This model was trained from scratch on an unkown dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

Training Loss	Epoch	Step	Validation Loss
No log	1.0	127	0.7652
No log	2.0	254	0.7520
No log	3.0	381	0.7681
0.9618	4.0	508	0.7337
0.9618	5.0	635	0.7560
0.9618	6.0	762	0.7397
0.9618	7.0	889	0.7298
0.6652	8.0	1016	0.7891
0.6652	9.0	1143	0.7874
0.6652	10.0	1270	0.7759
0.6652	11.0	1397	0.7505
0.6174	12.0	1524	0.7838
0.6174	13.0	1651	0.7878
0.6174	14.0	1778	0.8028
0.6174	15.0	1905	0.8154
0.5733	16.0	2032	0.8131
0.5733	17.0	2159	0.8278
0.5733	18.0	2286	0.8308
0.5733	19.0	2413	0.8433
0.5378	20.0	2540	0.8303
0.5378	21.0	2667	0.8352
0.5378	22.0	2794	0.8369
0.5378	23.0	2921	0.8518
0.5095	24.0	3048	0.8749
0.5095	25.0	3175	0.8533
0.5095	26.0	3302	0.8547
0.5095	27.0	3429	0.8844
0.4856	28.0	3556	0.8752
0.4856	29.0	3683	0.8804
0.4856	30.0	3810	0.8801