patrixtano commited on
Commit
ed2eb4a
1 Parent(s): 3175eaa

End of training

Browse files
Files changed (1) hide show
  1. README.md +10 -9
README.md CHANGED
@@ -16,9 +16,11 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: nan
20
- - Exact Match: 0.0
21
- - Gen Len: 0.0
 
 
22
 
23
  ## Model description
24
 
@@ -44,15 +46,14 @@ The following hyperparameters were used during training:
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
  - num_epochs: 3
47
- - mixed_precision_training: Native AMP
48
 
49
  ### Training results
50
 
51
- | Training Loss | Epoch | Step | Validation Loss | Exact Match | Gen Len |
52
- |:-------------:|:-----:|:----:|:---------------:|:-----------:|:-------:|
53
- | 0.0 | 1.0 | 2105 | nan | 0.0 | 0.0 |
54
- | 0.0 | 2.0 | 4210 | nan | 0.0 | 0.0 |
55
- | 0.0 | 3.0 | 6315 | nan | 0.0 | 0.0 |
56
 
57
 
58
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 0.5256
20
+ - Score: 43.6341
21
+ - Char Order: 6
22
+ - Word Order: 0
23
+ - Beta: 2
24
 
25
  ## Model description
26
 
 
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
48
  - num_epochs: 3
 
49
 
50
  ### Training results
51
 
52
+ | Training Loss | Epoch | Step | Validation Loss | Score | Char Order | Word Order | Beta |
53
+ |:-------------:|:-----:|:----:|:---------------:|:-------:|:----------:|:----------:|:----:|
54
+ | 1.2999 | 1.0 | 2105 | 0.7759 | 36.9398 | 6 | 0 | 2 |
55
+ | 0.87 | 2.0 | 4210 | 0.5735 | 41.0183 | 6 | 0 | 2 |
56
+ | 0.7796 | 3.0 | 6315 | 0.5256 | 43.6341 | 6 | 0 | 2 |
57
 
58
 
59
  ### Framework versions