phitime
/

flan-t5-small-finetuned-mlsum-tr

Text2Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

phitime commited on Jan 21

Commit

c0e6080

•

1 Parent(s): 63ecb42

End of training

Files changed (1) hide show

README.md +12 -16

README.md CHANGED Viewed

@@ -3,8 +3,6 @@ license: apache-2.0
 base_model: google/flan-t5-small
 tags:
 - generated_from_trainer
-metrics:
-- rouge
 model-index:
 - name: flan-t5-small-finetuned-mlsum-tr
   results: []
@@ -17,12 +15,17 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: nan
-- Rouge1: 4.1461
-- Rouge2: 1.3002
-- Rougel: 3.4895
-- Rougelsum: 3.663
-- Gen Len: 14.1553
 ## Model description
@@ -47,16 +50,9 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 1
 - mixed_precision_training: Native AMP
-### Training results
-| Training Loss | Epoch | Step  | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
-|:-------------:|:-----:|:-----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
-| 0.0           | 1.0   | 15580 | nan             | 4.1461 | 1.3002 | 3.4895 | 3.663     | 14.1553 |
 ### Framework versions
 - Transformers 4.35.2

 base_model: google/flan-t5-small
 tags:
 - generated_from_trainer
 model-index:
 - name: flan-t5-small-finetuned-mlsum-tr
   results: []
 This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- eval_loss: nan
+- eval_rouge1: 10.3443
+- eval_rouge2: 5.3615
+- eval_rougeL: 8.9871
+- eval_rougeLsum: 9.3134
+- eval_gen_len: 16.1858
+- eval_runtime: 378.4208
+- eval_samples_per_second: 33.759
+- eval_steps_per_second: 2.111
+- epoch: 2.0
+- step: 31160
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 5
 - mixed_precision_training: Native AMP
 ### Framework versions
 - Transformers 4.35.2