santyzenith commited on
Commit
76e30f3
1 Parent(s): 5f01788

End of training

Browse files
Files changed (1) hide show
  1. README.md +20 -20
README.md CHANGED
@@ -1,6 +1,6 @@
1
  ---
2
- license: mit
3
- base_model: flax-community/spanish-t5-small
4
  tags:
5
  - generated_from_trainer
6
  metrics:
@@ -15,14 +15,14 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  # augmented_t5_pictos
17
 
18
- This model is a fine-tuned version of [flax-community/spanish-t5-small](https://huggingface.co/flax-community/spanish-t5-small) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 1.4052
21
- - Rouge1: 42.0934
22
- - Rouge2: 28.3804
23
- - Rougel: 41.2489
24
- - Rougelsum: 41.3148
25
- - Gen Len: 8.4979
26
 
27
  ## Model description
28
 
@@ -41,7 +41,7 @@ More information needed
41
  ### Training hyperparameters
42
 
43
  The following hyperparameters were used during training:
44
- - learning_rate: 2e-05
45
  - train_batch_size: 8
46
  - eval_batch_size: 8
47
  - seed: 42
@@ -53,16 +53,16 @@ The following hyperparameters were used during training:
53
 
54
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
55
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
56
- | 2.3625 | 1.0 | 527 | 2.0249 | 34.7282 | 19.5441 | 33.9362 | 33.9053 | 7.6303 |
57
- | 1.9453 | 2.0 | 1054 | 1.7992 | 37.5587 | 22.5669 | 36.6904 | 36.7767 | 8.2671 |
58
- | 1.7429 | 3.0 | 1581 | 1.6655 | 39.7178 | 24.1301 | 38.8293 | 38.8615 | 8.3462 |
59
- | 1.6306 | 4.0 | 2108 | 1.5791 | 40.8591 | 26.0948 | 40.088 | 40.0812 | 7.9573 |
60
- | 1.461 | 5.0 | 2635 | 1.5204 | 40.6998 | 25.5825 | 39.8972 | 39.9642 | 8.3568 |
61
- | 1.4178 | 6.0 | 3162 | 1.4702 | 41.0675 | 26.4585 | 40.1879 | 40.2333 | 8.4679 |
62
- | 1.3394 | 7.0 | 3689 | 1.4452 | 41.9649 | 27.1925 | 41.0945 | 41.1233 | 8.2329 |
63
- | 1.2844 | 8.0 | 4216 | 1.4210 | 41.9633 | 27.7102 | 40.9804 | 41.0706 | 8.5406 |
64
- | 1.2151 | 9.0 | 4743 | 1.4072 | 41.9336 | 28.0917 | 41.0597 | 41.1328 | 8.4722 |
65
- | 1.215 | 10.0 | 5270 | 1.4052 | 42.0934 | 28.3804 | 41.2489 | 41.3148 | 8.4979 |
66
 
67
 
68
  ### Framework versions
 
1
  ---
2
+ license: apache-2.0
3
+ base_model: vgaraujov/t5-base-spanish
4
  tags:
5
  - generated_from_trainer
6
  metrics:
 
15
 
16
  # augmented_t5_pictos
17
 
18
+ This model is a fine-tuned version of [vgaraujov/t5-base-spanish](https://huggingface.co/vgaraujov/t5-base-spanish) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 1.2659
21
+ - Rouge1: 45.6149
22
+ - Rouge2: 30.0038
23
+ - Rougel: 44.5481
24
+ - Rougelsum: 44.5275
25
+ - Gen Len: 7.7863
26
 
27
  ## Model description
28
 
 
41
  ### Training hyperparameters
42
 
43
  The following hyperparameters were used during training:
44
+ - learning_rate: 3e-05
45
  - train_batch_size: 8
46
  - eval_batch_size: 8
47
  - seed: 42
 
53
 
54
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
55
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
56
+ | 2.4139 | 1.0 | 527 | 1.9162 | 36.3092 | 19.4412 | 35.3042 | 35.2912 | 6.6688 |
57
+ | 1.8071 | 2.0 | 1054 | 1.6680 | 42.3979 | 26.0796 | 41.4105 | 41.3967 | 7.0363 |
58
+ | 1.5205 | 3.0 | 1581 | 1.5088 | 43.033 | 27.2079 | 42.0521 | 42.0524 | 7.6261 |
59
+ | 1.3732 | 4.0 | 2108 | 1.4220 | 44.7554 | 28.8592 | 43.8614 | 43.8616 | 7.3504 |
60
+ | 1.2345 | 5.0 | 2635 | 1.3596 | 45.1026 | 29.8953 | 44.055 | 44.0191 | 7.5791 |
61
+ | 1.1474 | 6.0 | 3162 | 1.3102 | 44.9068 | 29.1063 | 43.7844 | 43.7561 | 7.7521 |
62
+ | 1.0654 | 7.0 | 3689 | 1.2925 | 45.5775 | 29.9282 | 44.5345 | 44.5368 | 7.6090 |
63
+ | 0.9925 | 8.0 | 4216 | 1.2732 | 45.3677 | 29.7696 | 44.3156 | 44.2848 | 7.7585 |
64
+ | 0.9616 | 9.0 | 4743 | 1.2663 | 45.6594 | 30.2393 | 44.6785 | 44.7119 | 7.8419 |
65
+ | 0.9296 | 10.0 | 5270 | 1.2659 | 45.6149 | 30.0038 | 44.5481 | 44.5275 | 7.7863 |
66
 
67
 
68
  ### Framework versions