tobijen
/

distilgpt2_right_headings

Text Generation

generated_from_keras_callback

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

tobijen commited on Jul 20, 2023

Commit

9bce583

•

1 Parent(s): 819e4c4

Training in progress epoch 0

Files changed (2) hide show

README.md +4 -9
tf_model.h5 +1 -1

README.md CHANGED Viewed

@@ -15,9 +15,9 @@ probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Train Loss: 3.0427
-- Validation Loss: 6.8548
-- Epoch: 5
 ## Model description
@@ -43,12 +43,7 @@ The following hyperparameters were used during training:
 | Train Loss | Validation Loss | Epoch |
 |:----------:|:---------------:|:-----:|
-| 4.3876     | 5.9195          | 0     |
-| 4.0392     | 6.0862          | 1     |
-| 3.7551     | 6.2602          | 2     |
-| 3.4935     | 6.4659          | 3     |
-| 3.2463     | 6.7317          | 4     |
-| 3.0427     | 6.8548          | 5     |
 ### Framework versions

 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Train Loss: 2.8208
+- Validation Loss: 6.9815
+- Epoch: 0
 ## Model description
 | Train Loss | Validation Loss | Epoch |
 |:----------:|:---------------:|:-----:|
+| 2.8208     | 6.9815          | 0     |
 ### Framework versions

tf_model.h5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f09acaa346756ce7fd5bc256bdd0fb22f675abf0bbeb6ea9a3d41ff65d10b3e6
 size 327745472

 version https://git-lfs.github.com/spec/v1
+oid sha256:a6b7b26b8022f0a43373d584e4dd34c9a07b516189dd6541946c918f1df8d2e2
 size 327745472