Gan1108 commited on
Commit
e681253
1 Parent(s): 2dbb93a

End of training

Browse files
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [malteos/gpt2-uk](https://huggingface.co/malteos/gpt2-uk) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 0.6971
19
 
20
  ## Model description
21
 
@@ -40,20 +40,25 @@ The following hyperparameters were used during training:
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
- - num_epochs: 3.0
44
 
45
  ### Training results
46
 
47
- | Training Loss | Epoch | Step | Validation Loss |
48
- |:-------------:|:-----:|:----:|:---------------:|
49
- | 0.8803 | 1.0 | 1730 | 0.7869 |
50
- | 0.7719 | 2.0 | 3460 | 0.7181 |
51
- | 0.7025 | 3.0 | 5190 | 0.6971 |
 
 
 
 
 
52
 
53
 
54
  ### Framework versions
55
 
56
  - Transformers 4.42.4
57
  - Pytorch 2.3.1+cu121
58
- - Datasets 2.20.0
59
  - Tokenizers 0.19.1
 
15
 
16
  This model is a fine-tuned version of [malteos/gpt2-uk](https://huggingface.co/malteos/gpt2-uk) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 0.6176
19
 
20
  ## Model description
21
 
 
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
+ - num_epochs: 8
44
 
45
  ### Training results
46
 
47
+ | Training Loss | Epoch | Step | Validation Loss |
48
+ |:-------------:|:-----:|:-----:|:---------------:|
49
+ | 0.8695 | 1.0 | 1729 | 0.7865 |
50
+ | 0.7478 | 2.0 | 3458 | 0.7112 |
51
+ | 0.6687 | 3.0 | 5187 | 0.6721 |
52
+ | 0.6157 | 4.0 | 6916 | 0.6469 |
53
+ | 0.589 | 5.0 | 8645 | 0.6342 |
54
+ | 0.5588 | 6.0 | 10374 | 0.6239 |
55
+ | 0.538 | 7.0 | 12103 | 0.6187 |
56
+ | 0.5224 | 8.0 | 13832 | 0.6176 |
57
 
58
 
59
  ### Framework versions
60
 
61
  - Transformers 4.42.4
62
  - Pytorch 2.3.1+cu121
63
+ - Datasets 2.21.0
64
  - Tokenizers 0.19.1
runs/Aug14_08-32-58_dea37f91c97c/events.out.tfevents.1723624380.dea37f91c97c.166.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:fcc745b83b4e9cfbd216de8dd3d95267e710feb468e4720a8a6805397c410ace
3
- size 12713
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3a4e79f85d82a22ac944735f9bf1b44abc3d89a9ac3da66638d3ca39355732c0
3
+ size 13338