nanom commited on
Commit
57e949f
1 Parent(s): c8d6f36

End of training

Browse files
Files changed (4) hide show
  1. README.md +7 -7
  2. pytorch_model.bin +1 -1
  3. tokenizer.json +2 -2
  4. training_args.bin +1 -1
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [DeepESP/gpt2-spanish](https://huggingface.co/DeepESP/gpt2-spanish) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 2.5287
19
 
20
  ## Model description
21
 
@@ -34,7 +34,7 @@ More information needed
34
  ### Training hyperparameters
35
 
36
  The following hyperparameters were used during training:
37
- - learning_rate: 2.5e-05
38
  - train_batch_size: 8
39
  - eval_batch_size: 8
40
  - seed: 42
@@ -46,11 +46,11 @@ The following hyperparameters were used during training:
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
- | 2.8091 | 1.0 | 40 | 2.6299 |
50
- | 2.3812 | 2.0 | 80 | 2.5587 |
51
- | 2.2525 | 3.0 | 120 | 2.5377 |
52
- | 2.1789 | 4.0 | 160 | 2.5308 |
53
- | 2.1258 | 5.0 | 200 | 2.5287 |
54
 
55
 
56
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [DeepESP/gpt2-spanish](https://huggingface.co/DeepESP/gpt2-spanish) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 4.7721
19
 
20
  ## Model description
21
 
 
34
  ### Training hyperparameters
35
 
36
  The following hyperparameters were used during training:
37
+ - learning_rate: 1.5e-05
38
  - train_batch_size: 8
39
  - eval_batch_size: 8
40
  - seed: 42
 
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
+ | 5.0919 | 1.0 | 40 | 4.9569 |
50
+ | 4.7032 | 2.0 | 80 | 4.8517 |
51
+ | 4.4604 | 3.0 | 120 | 4.8015 |
52
+ | 4.2456 | 4.0 | 160 | 4.7786 |
53
+ | 4.2514 | 5.0 | 200 | 4.7721 |
54
 
55
 
56
  ### Framework versions
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:de3ed75972c1b83582cca78a41b1b5c6412f6536da7097726dde53d38479b5d2
3
  size 497807197
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:68ebb5dfe2362a8175f811baf80ca8ddfd74630043b3b0a558fe18331402a6b9
3
  size 497807197
tokenizer.json CHANGED
@@ -2,13 +2,13 @@
2
  "version": "1.0",
3
  "truncation": {
4
  "direction": "Right",
5
- "max_length": 70,
6
  "strategy": "LongestFirst",
7
  "stride": 0
8
  },
9
  "padding": {
10
  "strategy": {
11
- "Fixed": 70
12
  },
13
  "direction": "Right",
14
  "pad_to_multiple_of": null,
 
2
  "version": "1.0",
3
  "truncation": {
4
  "direction": "Right",
5
+ "max_length": 33,
6
  "strategy": "LongestFirst",
7
  "stride": 0
8
  },
9
  "padding": {
10
  "strategy": {
11
+ "Fixed": 33
12
  },
13
  "direction": "Right",
14
  "pad_to_multiple_of": null,
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7317fae08b9be26bc6a609041b83209a0eb4b7a04684ca15fb19a6b0671a9028
3
  size 4027
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:adebedf23d7f801f7ba85275e0ee1427f3495306840468a84c32ca77058bdd14
3
  size 4027