gsvann
/

mt5-small-finetuned-amazon-en-de

@@ -17,11 +17,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.6964
-- Rouge1: 16.9495
-- Rouge2: 10.3864
-- Rougel: 16.6107
-- Rougelsum: 16.5146
 ## Model description
@@ -46,20 +46,18 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 8
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
-| 8.2166        | 1.0   | 651  | 3.1449          | 14.8158 | 7.6238  | 14.4691 | 14.317    |
-| 4.0965        | 2.0   | 1302 | 2.9073          | 14.7988 | 7.7674  | 14.5117 | 14.3895   |
-| 3.7254        | 3.0   | 1953 | 2.8142          | 14.5259 | 7.1771  | 14.3574 | 14.3033   |
-| 3.5485        | 4.0   | 2604 | 2.7675          | 16.6534 | 9.6695  | 16.5785 | 16.3698   |
-| 3.42          | 5.0   | 3255 | 2.7387          | 16.6817 | 9.9656  | 16.5414 | 16.414    |
-| 3.3369        | 6.0   | 3906 | 2.7054          | 17.2119 | 10.4457 | 17.0202 | 16.8504   |
-| 3.3025        | 7.0   | 4557 | 2.6915          | 16.2837 | 10.02   | 15.9801 | 15.8678   |
-| 3.2659        | 8.0   | 5208 | 2.6964          | 16.9495 | 10.3864 | 16.6107 | 16.5146   |
 ### Framework versions

 This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.6132
+- Rouge1: 18.5886
+- Rouge2: 10.4761
+- Rougel: 17.9705
+- Rougelsum: 18.2467
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 6
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
+| 3.0822        | 1.0   | 651  | 2.6232          | 17.8834 | 11.2967 | 17.4489 | 17.73     |
+| 2.9301        | 2.0   | 1302 | 2.6265          | 18.3784 | 11.1587 | 18.0152 | 18.1896   |
+| 2.8552        | 3.0   | 1953 | 2.6223          | 18.7889 | 11.1004 | 18.2222 | 18.4583   |
+| 2.8253        | 4.0   | 2604 | 2.6080          | 18.2031 | 10.2714 | 17.748  | 17.8395   |
+| 2.8044        | 5.0   | 3255 | 2.6154          | 18.5675 | 10.5794 | 17.967  | 18.2698   |
+| 2.7928        | 6.0   | 3906 | 2.6132          | 18.5886 | 10.4761 | 17.9705 | 18.2467   |
 ### Framework versions

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0e0c7b22a15e5d373d9c684f73006712f62edefda09cf19a161264c0f1248f08
 size 1200773058

 version https://git-lfs.github.com/spec/v1
+oid sha256:dc66756f726ca2bb65ee3f851abac731eb68f8e31657faf0c0ce42470aa45354
 size 1200773058

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:bf2b2b343ad9dd4f0aa666d21176f73443fa1a6eb7c4fcaa5df5de460072bc11
 size 4664

 version https://git-lfs.github.com/spec/v1
+oid sha256:0c3bf9535f955a89e82e4cc1478fe509fc4302a1aa4ff898439dda5668decf5e
 size 4664