sauc-abadal-lloret commited on
Commit
4f7ff9d
1 Parent(s): de0b71b

Training complete

Browse files
README.md CHANGED
@@ -19,11 +19,12 @@ should probably proofread and complete it, then remove this comment. -->
19
 
20
  This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on an unknown dataset.
21
  It achieves the following results on the evaluation set:
22
- - Loss: nan
23
- - Rouge1: 0.0
24
- - Rouge2: 0.0
25
- - Rougel: 0.0
26
- - Rougelsum: 0.0
 
27
 
28
  ## Model description
29
 
@@ -42,14 +43,13 @@ More information needed
42
  ### Training hyperparameters
43
 
44
  The following hyperparameters were used during training:
45
- - learning_rate: 5.6e-05
46
- - train_batch_size: 16
47
- - eval_batch_size: 16
48
  - seed: 42
49
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
50
  - lr_scheduler_type: linear
51
  - num_epochs: 4
52
- - mixed_precision_training: Native AMP
53
 
54
  ### Training results
55
 
@@ -59,5 +59,5 @@ The following hyperparameters were used during training:
59
 
60
  - Transformers 4.44.2
61
  - Pytorch 2.4.0+cu121
62
- - Datasets 2.21.0
63
  - Tokenizers 0.19.1
 
19
 
20
  This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on an unknown dataset.
21
  It achieves the following results on the evaluation set:
22
+ - Loss: 3.3696
23
+ - Model Preparation Time: 0.0085
24
+ - Rouge1: 19.7992
25
+ - Rouge2: 5.6898
26
+ - Rougel: 16.7802
27
+ - Rougelsum: 16.7497
28
 
29
  ## Model description
30
 
 
43
  ### Training hyperparameters
44
 
45
  The following hyperparameters were used during training:
46
+ - learning_rate: 2e-05
47
+ - train_batch_size: 8
48
+ - eval_batch_size: 8
49
  - seed: 42
50
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
51
  - lr_scheduler_type: linear
52
  - num_epochs: 4
 
53
 
54
  ### Training results
55
 
 
59
 
60
  - Transformers 4.44.2
61
  - Pytorch 2.4.0+cu121
62
+ - Datasets 3.0.0
63
  - Tokenizers 0.19.1
runs/Sep17_07-22-14_817911b27cd9/events.out.tfevents.1726560360.817911b27cd9.490.2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fe05177f314ad86c98371dd658177829741d0f901c25ed34ccc3ce082e152d00
3
+ size 628