stojchet
/

jkto10k1-jsft8

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

stojchet commited on Jul 18

Commit

f3b02de

•

1 Parent(s): 80454c3

End of training

Files changed (1) hide show

README.md +10 -0

README.md CHANGED Viewed

@@ -15,9 +15,12 @@ model-index:
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
 # jkto10k1-jsft8
 This model is a fine-tuned version of [stojchet/jkto10k1](https://huggingface.co/stojchet/jkto10k1) on the generator dataset.
 ## Model description
@@ -47,6 +50,13 @@ The following hyperparameters were used during training:
 - lr_scheduler_warmup_steps: 200
 - num_epochs: 3
 ### Framework versions
 - Transformers 4.43.0.dev0

 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/stojchets/huggingface/runs/jkto10k1-jsft8)
 # jkto10k1-jsft8
 This model is a fine-tuned version of [stojchet/jkto10k1](https://huggingface.co/stojchet/jkto10k1) on the generator dataset.
+It achieves the following results on the evaluation set:
+- Loss: 1.1946
 ## Model description
 - lr_scheduler_warmup_steps: 200
 - num_epochs: 3
+### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| 1.0631        | 2.56  | 100  | 1.1946          |
 ### Framework versions
 - Transformers 4.43.0.dev0