sgimmel commited on
Commit
0cc518a
1 Parent(s): 47927a8

End of training

Browse files
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [EleutherAI/gpt-neo-125m](https://huggingface.co/EleutherAI/gpt-neo-125m) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 4.5588
19
 
20
  ## Model description
21
 
@@ -40,15 +40,22 @@ The following hyperparameters were used during training:
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
- - num_epochs: 3.0
44
 
45
  ### Training results
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
- | No log | 1.0 | 173 | 4.6374 |
50
- | No log | 2.0 | 346 | 4.5730 |
51
- | 4.6126 | 3.0 | 519 | 4.5588 |
 
 
 
 
 
 
 
52
 
53
 
54
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [EleutherAI/gpt-neo-125m](https://huggingface.co/EleutherAI/gpt-neo-125m) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 4.5221
19
 
20
  ## Model description
21
 
 
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
+ - num_epochs: 10
44
 
45
  ### Training results
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
+ | No log | 1.0 | 192 | 4.6578 |
50
+ | No log | 2.0 | 384 | 4.5784 |
51
+ | 4.5695 | 3.0 | 576 | 4.5419 |
52
+ | 4.5695 | 4.0 | 768 | 4.5219 |
53
+ | 4.5695 | 5.0 | 960 | 4.5098 |
54
+ | 4.1799 | 6.0 | 1152 | 4.5060 |
55
+ | 4.1799 | 7.0 | 1344 | 4.5073 |
56
+ | 3.9822 | 8.0 | 1536 | 4.5148 |
57
+ | 3.9822 | 9.0 | 1728 | 4.5200 |
58
+ | 3.9822 | 10.0 | 1920 | 4.5221 |
59
 
60
 
61
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5a009379f8178c340d0a6e70ad7a9e1f1bf812b2d9d132ed884df051ef6376f4
3
  size 500811336
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2f1bb46a2e269754ef14e0a59a9cfd9f340654511cb738f18181eb4afa626ff7
3
  size 500811336
runs/Nov21_20-27-31_01595a07d75a/events.out.tfevents.1700598461.01595a07d75a.47.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:33b7fbfc8551e5d5d993100edc4ca04321521cda746e2a1f5d8669054319595a
3
- size 7070
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:43b6a060d625c898e8727bcf124ffd440929e5dff73bc670923649faa9a68e2d
3
+ size 8237
runs/Nov21_20-27-31_01595a07d75a/events.out.tfevents.1700598909.01595a07d75a.47.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ebca2c65adac8fbe214c1e742d7ccdda2f221d69ff613d2d0fac6b1a9d05c2b7
3
+ size 359