Training complete

Files changed (4) hide show

README.md CHANGED Viewed

@@ -19,11 +19,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.0193
-- Rouge1: 17.1639
-- Rouge2: 8.3005
-- Rougel: 16.8294
-- Rougelsum: 16.8084
 ## Model description
@@ -54,14 +54,14 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2 | Rougel  | Rougelsum |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:------:|:-------:|:---------:|
-| 3.6768        | 1.0   | 1209 | 3.2182          | 17.7771 | 9.2416 | 17.1971 | 17.2872   |
-| 3.6447        | 2.0   | 2418 | 3.1029          | 17.4703 | 8.7095 | 16.9938 | 16.9346   |
-| 3.4304        | 3.0   | 3627 | 3.0759          | 15.8554 | 7.5643 | 15.2637 | 15.2583   |
-| 3.3128        | 4.0   | 4836 | 3.0706          | 17.0706 | 8.7201 | 16.7226 | 16.6156   |
-| 3.2203        | 5.0   | 6045 | 3.0339          | 16.5228 | 7.6729 | 16.0783 | 15.9614   |
-| 3.1651        | 6.0   | 7254 | 3.0283          | 16.5121 | 7.9439 | 16.1227 | 16.0756   |
-| 3.1387        | 7.0   | 8463 | 3.0188          | 16.6558 | 8.1024 | 16.3989 | 16.39     |
-| 3.1139        | 8.0   | 9672 | 3.0193          | 17.1639 | 8.3005 | 16.8294 | 16.8084   |
 ### Framework versions

 This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.0303
+- Rouge1: 16.6072
+- Rouge2: 7.5336
+- Rougel: 16.1402
+- Rougelsum: 16.1129
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2 | Rougel  | Rougelsum |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:------:|:-------:|:---------:|
+| 6.9675        | 1.0   | 1209 | 3.2986          | 15.3583 | 6.7842 | 14.8665 | 14.8268   |
+| 3.8997        | 2.0   | 2418 | 3.1665          | 16.4507 | 7.5059 | 15.7996 | 15.861    |
+| 3.5826        | 3.0   | 3627 | 3.1106          | 17.1966 | 8.2927 | 16.6054 | 16.4505   |
+| 3.421         | 4.0   | 4836 | 3.0963          | 17.3181 | 8.7401 | 16.8773 | 16.8044   |
+| 3.3089        | 5.0   | 6045 | 3.0490          | 16.7047 | 7.5184 | 16.2967 | 16.1796   |
+| 3.2437        | 6.0   | 7254 | 3.0401          | 16.6837 | 7.8027 | 16.0671 | 16.0037   |
+| 3.2133        | 7.0   | 8463 | 3.0292          | 16.3691 | 7.5515 | 16.012  | 15.9647   |
+| 3.1851        | 8.0   | 9672 | 3.0303          | 16.6072 | 7.5336 | 16.1402 | 16.1129   |
 ### Framework versions

generation_config.json CHANGED Viewed

@@ -1,5 +1,4 @@
 {
-  "_from_model_config": true,
   "decoder_start_token_id": 0,
   "eos_token_id": 1,
   "pad_token_id": 0,

 {
   "decoder_start_token_id": 0,
   "eos_token_id": 1,
   "pad_token_id": 0,

runs/Dec03_22-51-16_475927a3c946/events.out.tfevents.1733266302.475927a3c946.1019.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6c408b1dcb922061ec6263ab91025078ce4b6b93892d7ece465cf75852060bee
-size 10395

 version https://git-lfs.github.com/spec/v1
+oid sha256:f01ccdd37ea28f6c72202f3c3591b585cefa0b1a71c0eb094e1ff5f7f8a44521
+size 11223

runs/Dec03_22-51-16_475927a3c946/events.out.tfevents.1733270187.475927a3c946.1019.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:59d37cf3965700c45b905add81c7c068fe6c805793cab3539e8cb80d538b0887
+size 562