Vexemous commited on
Commit
180ec89
1 Parent(s): a16f44f

End of training

Browse files
README.md CHANGED
@@ -14,6 +14,8 @@ should probably proofread and complete it, then remove this comment. -->
14
  # distilgpt2-finetuned-general-stories
15
 
16
  This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
 
 
17
 
18
  ## Model description
19
 
@@ -41,6 +43,22 @@ The following hyperparameters were used during training:
41
  - num_epochs: 10
42
  - mixed_precision_training: Native AMP
43
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
44
  ### Framework versions
45
 
46
  - Transformers 4.40.0
 
14
  # distilgpt2-finetuned-general-stories
15
 
16
  This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
17
+ It achieves the following results on the evaluation set:
18
+ - Loss: 4.0063
19
 
20
  ## Model description
21
 
 
43
  - num_epochs: 10
44
  - mixed_precision_training: Native AMP
45
 
46
+ ### Training results
47
+
48
+ | Training Loss | Epoch | Step | Validation Loss |
49
+ |:-------------:|:-----:|:-----:|:---------------:|
50
+ | 5.7415 | 1.0 | 2940 | 5.4942 |
51
+ | 5.1216 | 2.0 | 5880 | 4.9520 |
52
+ | 4.7464 | 3.0 | 8820 | 4.6264 |
53
+ | 4.5081 | 4.0 | 11760 | 4.4148 |
54
+ | 4.3457 | 5.0 | 14700 | 4.2677 |
55
+ | 4.2207 | 6.0 | 17640 | 4.1709 |
56
+ | 4.1182 | 7.0 | 20580 | 4.0961 |
57
+ | 4.0599 | 8.0 | 23520 | 4.0473 |
58
+ | 4.0235 | 9.0 | 26460 | 4.0176 |
59
+ | 3.9931 | 10.0 | 29400 | 4.0063 |
60
+
61
+
62
  ### Framework versions
63
 
64
  - Transformers 4.40.0
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:6dc022464bbb011d8f63f8b7e3ad7aaf7cf2fee228f870a60465cb6588988621
3
  size 327657928
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:70cb14c6404b8e75f6dcde3d56e9d9a1f141eb5566c91cfeab1a7fbb7ff3c97f
3
  size 327657928
runs/Apr20_03-59-17_instance-20240420-041200/events.out.tfevents.1713585558.instance-20240420-041200 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:640a2b50d90de8eec06221e4c09bd65b517f26821878aa1f50cead9f3a19e57a
3
- size 19611
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b8e07a484b8ba63fce67e87a023c04c51ef1aa8b885f78718c65b126c69e451c
3
+ size 20462