ysasaki6023
/

results

Generated from Trainer

Model card Files Files and versions Community

ysasaki6023 commited on Apr 14, 2023

Commit

999d746

•

1 Parent(s): 147fc05

update model card README.md

Files changed (1) hide show

README.md +3 -5

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the rotten_tomatoes dataset.
 It achieves the following results on the evaluation set:
-- Loss: 5.2147
 ## Model description
@@ -39,18 +39,16 @@ The following hyperparameters were used during training:
 - train_batch_size: 1
 - eval_batch_size: 1
 - seed: 42
-- gradient_accumulation_steps: 128
-- total_train_batch_size: 128
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
-- training_steps: 2
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 4.8988        | 1.0   | 2    | 5.2147          |
 ### Framework versions

 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the rotten_tomatoes dataset.
 It achieves the following results on the evaluation set:
+- Loss: 5.2143
 ## Model description
 - train_batch_size: 1
 - eval_batch_size: 1
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
+- training_steps: 16
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 4.8863        | 1.0   | 16   | 5.2143          |
 ### Framework versions