oSabre commited on
Commit
ee64a3c
1 Parent(s): c2e4a7a

End of training

Browse files
README.md CHANGED
@@ -1,6 +1,6 @@
1
  ---
2
  license: apache-2.0
3
- base_model: t5-small
4
  tags:
5
  - generated_from_trainer
6
  datasets:
@@ -16,13 +16,13 @@ model-index:
16
  dataset:
17
  name: opus_books
18
  type: opus_books
19
- config: en-pt
20
  split: train
21
- args: en-pt
22
  metrics:
23
  - name: Bleu
24
  type: bleu
25
- value: 0.3989
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -30,11 +30,11 @@ should probably proofread and complete it, then remove this comment. -->
30
 
31
  # opus_books_es_pt
32
 
33
- This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the opus_books dataset.
34
  It achieves the following results on the evaluation set:
35
- - Loss: 3.3303
36
- - Bleu: 0.3989
37
- - Gen Len: 17.5302
38
 
39
  ## Model description
40
 
@@ -54,8 +54,8 @@ More information needed
54
 
55
  The following hyperparameters were used during training:
56
  - learning_rate: 2e-05
57
- - train_batch_size: 16
58
- - eval_batch_size: 16
59
  - seed: 42
60
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
61
  - lr_scheduler_type: linear
@@ -66,16 +66,16 @@ The following hyperparameters were used during training:
66
 
67
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
68
  |:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|
69
- | No log | 1.0 | 71 | 3.7759 | 0.5559 | 16.9715 |
70
- | No log | 2.0 | 142 | 3.5343 | 0.517 | 17.2776 |
71
- | No log | 3.0 | 213 | 3.4102 | 0.4355 | 17.4448 |
72
- | No log | 4.0 | 284 | 3.3491 | 0.4057 | 17.516 |
73
- | No log | 5.0 | 355 | 3.3303 | 0.3989 | 17.5302 |
74
 
75
 
76
  ### Framework versions
77
 
78
  - Transformers 4.36.1
79
- - Pytorch 2.1.0+cu121
80
- - Datasets 2.15.0
81
  - Tokenizers 0.15.0
 
1
  ---
2
  license: apache-2.0
3
+ base_model: t5-base
4
  tags:
5
  - generated_from_trainer
6
  datasets:
 
16
  dataset:
17
  name: opus_books
18
  type: opus_books
19
+ config: es-pt
20
  split: train
21
+ args: es-pt
22
  metrics:
23
  - name: Bleu
24
  type: bleu
25
+ value: 0.8634
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
30
 
31
  # opus_books_es_pt
32
 
33
+ This model is a fine-tuned version of [t5-base](https://huggingface.co/t5-base) on the opus_books dataset.
34
  It achieves the following results on the evaluation set:
35
+ - Loss: 2.3338
36
+ - Bleu: 0.8634
37
+ - Gen Len: 18.4624
38
 
39
  ## Model description
40
 
 
54
 
55
  The following hyperparameters were used during training:
56
  - learning_rate: 2e-05
57
+ - train_batch_size: 8
58
+ - eval_batch_size: 8
59
  - seed: 42
60
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
61
  - lr_scheduler_type: linear
 
66
 
67
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
68
  |:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|
69
+ | No log | 1.0 | 133 | 2.5425 | 0.5463 | 18.5526 |
70
+ | No log | 2.0 | 266 | 2.4200 | 0.7483 | 18.515 |
71
+ | No log | 3.0 | 399 | 2.3680 | 0.7772 | 18.5226 |
72
+ | 2.705 | 4.0 | 532 | 2.3432 | 0.8192 | 18.4962 |
73
+ | 2.705 | 5.0 | 665 | 2.3338 | 0.8634 | 18.4624 |
74
 
75
 
76
  ### Framework versions
77
 
78
  - Transformers 4.36.1
79
+ - Pytorch 2.0.0
80
+ - Datasets 2.1.0
81
  - Tokenizers 0.15.0
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:52d19bbb208486b780016794d3702f05c8763ab3105dbbb7c1e737aa19184555
3
  size 891644712
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:044eee3e57364eb7b029f74c1914596642eecdcb1bd00b85b77c5cd2d2f2d85c
3
  size 891644712
runs/Dec17_14-29-40_222b4dc5c326/events.out.tfevents.1702823381.222b4dc5c326.42.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7d73fc141f5543be713240a0a190d623171cb3da290e9ffc6f0e47299306c033
3
- size 6634
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a917ad8ac542c1c3dca75e5d128d185e2d5791b8020c1f4e1f3e59ec8d6d23f9
3
+ size 7728