ZhiguangHan commited on
Commit
649ad98
1 Parent(s): 8645c7d

End of training

Browse files
README.md CHANGED
@@ -17,11 +17,11 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 1.8640
21
- - Rouge1: 0.4574
22
- - Rouge2: 0.1348
23
- - Rougel: 0.409
24
- - Rougelsum: 0.4095
25
 
26
  ## Model description
27
 
@@ -46,15 +46,17 @@ The following hyperparameters were used during training:
46
  - seed: 42
47
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
  - lr_scheduler_type: linear
49
- - num_epochs: 3
50
 
51
  ### Training results
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
54
  |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|
55
- | 2.1853 | 1.0 | 500 | 1.9546 | 0.4401 | 0.1222 | 0.4091 | 0.4094 |
56
- | 2.0041 | 2.0 | 1000 | 1.8818 | 0.4543 | 0.131 | 0.4042 | 0.4044 |
57
- | 1.9697 | 3.0 | 1500 | 1.8640 | 0.4574 | 0.1348 | 0.409 | 0.4095 |
 
 
58
 
59
 
60
  ### Framework versions
 
17
 
18
  This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 1.6987
21
+ - Rouge1: 0.5051
22
+ - Rouge2: 0.1584
23
+ - Rougel: 0.46
24
+ - Rougelsum: 0.4594
25
 
26
  ## Model description
27
 
 
46
  - seed: 42
47
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
  - lr_scheduler_type: linear
49
+ - num_epochs: 5
50
 
51
  ### Training results
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
54
  |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|
55
+ | 1.7426 | 1.0 | 500 | 1.8457 | 0.4597 | 0.1261 | 0.4121 | 0.4121 |
56
+ | 1.6948 | 2.0 | 1000 | 1.7994 | 0.4827 | 0.145 | 0.435 | 0.4347 |
57
+ | 1.7729 | 3.0 | 1500 | 1.7391 | 0.4949 | 0.1526 | 0.4522 | 0.4524 |
58
+ | 1.8046 | 4.0 | 2000 | 1.7093 | 0.5028 | 0.1547 | 0.4578 | 0.4576 |
59
+ | 1.7665 | 5.0 | 2500 | 1.6987 | 0.5051 | 0.1584 | 0.46 | 0.4594 |
60
 
61
 
62
  ### Framework versions
runs/Dec07_06-15-20_ef14320b1293/events.out.tfevents.1701929728.ef14320b1293.9784.2 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:6184aaf49ef76165b5c5e96b80a266cbf0c32dc08e41c9e0ea8214e35a4d143d
3
- size 7211
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:147b238498f93cd58fec996fc86499d9729a9fe0272642e51e8bd9bd0f492394
3
+ size 8039