sagarsidhwa commited on
Commit
7c7fb03
1 Parent(s): b013681

V1 Training complete

Browse files
Files changed (1) hide show
  1. README.md +9 -7
README.md CHANGED
@@ -19,11 +19,11 @@ should probably proofread and complete it, then remove this comment. -->
19
 
20
  This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on an unknown dataset.
21
  It achieves the following results on the evaluation set:
22
- - Loss: 3.3451
23
- - Rouge1: 12.9978
24
- - Rouge2: 5.3218
25
- - Rougel: 12.5685
26
- - Rougelsum: 12.5517
27
 
28
  ## Model description
29
 
@@ -48,13 +48,15 @@ The following hyperparameters were used during training:
48
  - seed: 42
49
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
50
  - lr_scheduler_type: linear
51
- - num_epochs: 1
52
 
53
  ### Training results
54
 
55
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
56
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:------:|:-------:|:---------:|
57
- | 4.4084 | 1.0 | 968 | 3.3451 | 12.9978 | 5.3218 | 12.5685 | 12.5517 |
 
 
58
 
59
 
60
  ### Framework versions
 
19
 
20
  This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on an unknown dataset.
21
  It achieves the following results on the evaluation set:
22
+ - Loss: 3.2107
23
+ - Rouge1: 16.5873
24
+ - Rouge2: 8.3667
25
+ - Rougel: 16.096
26
+ - Rougelsum: 16.0654
27
 
28
  ## Model description
29
 
 
48
  - seed: 42
49
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
50
  - lr_scheduler_type: linear
51
+ - num_epochs: 3
52
 
53
  ### Training results
54
 
55
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
56
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:------:|:-------:|:---------:|
57
+ | 7.4326 | 1.0 | 968 | 3.3609 | 14.4654 | 5.3488 | 14.1032 | 14.1348 |
58
+ | 4.1082 | 2.0 | 1936 | 3.2265 | 15.9058 | 7.6084 | 15.3178 | 15.3304 |
59
+ | 3.8946 | 3.0 | 2904 | 3.2107 | 16.5873 | 8.3667 | 16.096 | 16.0654 |
60
 
61
 
62
  ### Framework versions