Krylova
/

mt5-small-finetuned-amazon-en-de

@@ -17,11 +17,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.6546
-- Rouge1: 18.2986
-- Rouge2: 10.9624
-- Rougel: 17.8943
-- Rougelsum: 18.0009
 ## Model description
@@ -46,16 +46,20 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 4
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum |
-|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
-| 3.3465        | 1.0   | 1301 | 2.7467          | 16.9837 | 10.688  | 16.7552 | 16.8412   |
-| 2.9974        | 2.0   | 2602 | 2.7116          | 17.3625 | 9.9383  | 17.2101 | 17.2771   |
-| 3.2834        | 3.0   | 3903 | 2.6592          | 17.8403 | 10.7668 | 17.6087 | 17.6695   |
-| 3.2139        | 4.0   | 5204 | 2.6546          | 18.2986 | 10.9624 | 17.8943 | 18.0009   |
 ### Framework versions

 This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.5844
+- Rouge1: 18.6058
+- Rouge2: 10.0803
+- Rougel: 18.025
+- Rougelsum: 18.2237
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 8
 ### Training results
+| Training Loss | Epoch | Step  | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum |
+|:-------------:|:-----:|:-----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
+| 2.6887        | 1.0   | 1301  | 2.7862          | 20.3987 | 12.8512 | 19.7713 | 19.7397   |
+| 2.5315        | 2.0   | 2602  | 2.7636          | 19.7025 | 11.5086 | 19.2285 | 19.1621   |
+| 2.9455        | 3.0   | 3903  | 2.6457          | 20.5245 | 12.445  | 19.9432 | 19.9865   |
+| 2.9864        | 4.0   | 5204  | 2.5944          | 19.0345 | 10.3224 | 18.5022 | 18.5792   |
+| 2.9746        | 5.0   | 6505  | 2.5910          | 19.5747 | 10.3954 | 18.9401 | 19.1369   |
+| 2.9246        | 6.0   | 7806  | 2.5822          | 18.5846 | 9.8889  | 18.0374 | 18.2259   |
+| 2.8968        | 7.0   | 9107  | 2.5757          | 18.8335 | 10.2201 | 18.2386 | 18.4522   |
+| 2.8645        | 8.0   | 10408 | 2.5844          | 18.6058 | 10.0803 | 18.025  | 18.2237   |
 ### Framework versions

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:821c15f836a825ed81bbb0b9d39f28f2413e0bd1625e44205d995b1e70bda918
 size 1200773058

 version https://git-lfs.github.com/spec/v1
+oid sha256:7ed3e89d4b71eb6211d1e7863bb78e4bdbc148e39d673093724969a32663c3e7
 size 1200773058

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:fc3ac3d085a95d503b7d511d2d683cd83dda4647f8138e7203f59f20aed1caf5
 size 4664

 version https://git-lfs.github.com/spec/v1
+oid sha256:8e01c3a8b8531b9b8d88db32d1c0aa12565acd99dcd8aa23652ce885aba3c696
 size 4664