nanom
/

gtp_adaptation_martin_fierro_v2

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

nanom commited on Sep 12, 2023

Commit

57e949f

•

1 Parent(s): c8d6f36

End of training

Files changed (4) hide show

README.md +7 -7
pytorch_model.bin +1 -1
tokenizer.json +2 -2
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [DeepESP/gpt2-spanish](https://huggingface.co/DeepESP/gpt2-spanish) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.5287
 ## Model description
@@ -34,7 +34,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 2.5e-05
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
@@ -46,11 +46,11 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 2.8091        | 1.0   | 40   | 2.6299          |
-| 2.3812        | 2.0   | 80   | 2.5587          |
-| 2.2525        | 3.0   | 120  | 2.5377          |
-| 2.1789        | 4.0   | 160  | 2.5308          |
-| 2.1258        | 5.0   | 200  | 2.5287          |
 ### Framework versions

 This model is a fine-tuned version of [DeepESP/gpt2-spanish](https://huggingface.co/DeepESP/gpt2-spanish) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 4.7721
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 1.5e-05
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 5.0919        | 1.0   | 40   | 4.9569          |
+| 4.7032        | 2.0   | 80   | 4.8517          |
+| 4.4604        | 3.0   | 120  | 4.8015          |
+| 4.2456        | 4.0   | 160  | 4.7786          |
+| 4.2514        | 5.0   | 200  | 4.7721          |
 ### Framework versions

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:de3ed75972c1b83582cca78a41b1b5c6412f6536da7097726dde53d38479b5d2
 size 497807197

 version https://git-lfs.github.com/spec/v1
+oid sha256:68ebb5dfe2362a8175f811baf80ca8ddfd74630043b3b0a558fe18331402a6b9
 size 497807197

tokenizer.json CHANGED Viewed

@@ -2,13 +2,13 @@
   "version": "1.0",
   "truncation": {
     "direction": "Right",
-    "max_length": 70,
     "strategy": "LongestFirst",
     "stride": 0
   },
   "padding": {
     "strategy": {
-      "Fixed": 70
     },
     "direction": "Right",
     "pad_to_multiple_of": null,

   "version": "1.0",
   "truncation": {
     "direction": "Right",
+    "max_length": 33,
     "strategy": "LongestFirst",
     "stride": 0
   },
   "padding": {
     "strategy": {
+      "Fixed": 33
     },
     "direction": "Right",
     "pad_to_multiple_of": null,

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7317fae08b9be26bc6a609041b83209a0eb4b7a04684ca15fb19a6b0671a9028
 size 4027

 version https://git-lfs.github.com/spec/v1
+oid sha256:adebedf23d7f801f7ba85275e0ee1427f3495306840468a84c32ca77058bdd14
 size 4027