End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [EleutherAI/gpt-neo-125m](https://huggingface.co/EleutherAI/gpt-neo-125m) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 4.5588
 ## Model description
@@ -40,15 +40,22 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 3.0
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 173  | 4.6374          |
-| No log        | 2.0   | 346  | 4.5730          |
-| 4.6126        | 3.0   | 519  | 4.5588          |
 ### Framework versions

 This model is a fine-tuned version of [EleutherAI/gpt-neo-125m](https://huggingface.co/EleutherAI/gpt-neo-125m) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 4.5221
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 192  | 4.6578          |
+| No log        | 2.0   | 384  | 4.5784          |
+| 4.5695        | 3.0   | 576  | 4.5419          |
+| 4.5695        | 4.0   | 768  | 4.5219          |
+| 4.5695        | 5.0   | 960  | 4.5098          |
+| 4.1799        | 6.0   | 1152 | 4.5060          |
+| 4.1799        | 7.0   | 1344 | 4.5073          |
+| 3.9822        | 8.0   | 1536 | 4.5148          |
+| 3.9822        | 9.0   | 1728 | 4.5200          |
+| 3.9822        | 10.0  | 1920 | 4.5221          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5a009379f8178c340d0a6e70ad7a9e1f1bf812b2d9d132ed884df051ef6376f4
 size 500811336

 version https://git-lfs.github.com/spec/v1
+oid sha256:2f1bb46a2e269754ef14e0a59a9cfd9f340654511cb738f18181eb4afa626ff7
 size 500811336

runs/Nov21_20-27-31_01595a07d75a/events.out.tfevents.1700598461.01595a07d75a.47.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:33b7fbfc8551e5d5d993100edc4ca04321521cda746e2a1f5d8669054319595a
-size 7070

 version https://git-lfs.github.com/spec/v1
+oid sha256:43b6a060d625c898e8727bcf124ffd440929e5dff73bc670923649faa9a68e2d
+size 8237

runs/Nov21_20-27-31_01595a07d75a/events.out.tfevents.1700598909.01595a07d75a.47.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:ebca2c65adac8fbe214c1e742d7ccdda2f221d69ff613d2d0fac6b1a9d05c2b7
+size 359