nanom
/

gtp_adaptation_martin_fierro_v2

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

nanom commited on Sep 12, 2023

Commit

4742ee7

•

1 Parent(s): 7c37dfc

End of training

Files changed (3) hide show

README.md +9 -13
pytorch_model.bin +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [DeepESP/gpt2-spanish](https://huggingface.co/DeepESP/gpt2-spanish) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.5701
 ## Model description
@@ -35,26 +35,22 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 32
-- eval_batch_size: 32
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 9
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 2.9511        | 1.0   | 10   | 2.8409          |
-| 2.7597        | 2.0   | 20   | 2.7089          |
-| 2.6189        | 3.0   | 30   | 2.6518          |
-| 2.5426        | 4.0   | 40   | 2.6172          |
-| 2.4481        | 5.0   | 50   | 2.5969          |
-| 2.4149        | 6.0   | 60   | 2.5836          |
-| 2.3332        | 7.0   | 70   | 2.5756          |
-| 2.2836        | 8.0   | 80   | 2.5715          |
-| 2.2832        | 9.0   | 90   | 2.5701          |
 ### Framework versions

 This model is a fine-tuned version of [DeepESP/gpt2-spanish](https://huggingface.co/DeepESP/gpt2-spanish) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.5855
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 16
+- eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 5
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 2.8153        | 1.0   | 20   | 2.7398          |
+| 2.6191        | 2.0   | 40   | 2.6462          |
+| 2.5068        | 3.0   | 60   | 2.6071          |
+| 2.453         | 4.0   | 80   | 2.5908          |
+| 2.3603        | 5.0   | 100  | 2.5855          |
 ### Framework versions

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a61029606936c213650c6a2007f51451344b9c4d742bcb8c3ad71b4dffd92d06
 size 497807197

 version https://git-lfs.github.com/spec/v1
+oid sha256:05cbe34b0917217e7d3897e8e3dc1eeb850ef6cabcd9860adf9fb762bc1bfb8e
 size 497807197

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1a25e59246f25a99d9fae71aba690b69a6b9ceb6fd5056f994eb76a3bbcc01de
 size 4027

 version https://git-lfs.github.com/spec/v1
+oid sha256:229de42a62ee24b85beed3b30d649c2c7e1f17ebd62d3474456e243d94e32e9a
 size 4027