shubhambhawsar
/

mt5-small-finetuned-en-to-hi

Text2Text Generation

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

shubhambhawsar commited on Jan 10

Commit

30e3f46

•

1 Parent(s): f90757d

End of training

Files changed (2) hide show

README.md +9 -13
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -3,8 +3,6 @@ license: apache-2.0
 base_model: google/mt5-small
 tags:
 - generated_from_trainer
-metrics:
-- bleu
 model-index:
 - name: mt5-small-finetuned-en-to-hi
   results: []
@@ -17,9 +15,14 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.1833
-- Bleu: 0.8678
-- Gen Len: 4.7065
 ## Model description
@@ -44,16 +47,9 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 1
 - mixed_precision_training: Native AMP
-### Training results
-| Training Loss | Epoch | Step  | Validation Loss | Bleu   | Gen Len |
-|:-------------:|:-----:|:-----:|:---------------:|:------:|:-------:|
-| 3.8437        | 1.0   | 20322 | 3.1833          | 0.8678 | 4.7065  |
 ### Framework versions
 - Transformers 4.36.2

 base_model: google/mt5-small
 tags:
 - generated_from_trainer
 model-index:
 - name: mt5-small-finetuned-en-to-hi
   results: []
 This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
 It achieves the following results on the evaluation set:
+- eval_loss: 5.5348
+- eval_bleu: 0.0
+- eval_gen_len: 2.0828
+- eval_runtime: 96.4745
+- eval_samples_per_second: 68.091
+- eval_steps_per_second: 2.135
+- epoch: 0.05
+- step: 1000
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 3
 - mixed_precision_training: Native AMP
 ### Framework versions
 - Transformers 4.36.2

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f26a5659c4084526d2e0aaf4eb523d629e80ad5aa78569242fbd34331115df83
 size 1200729512

 version https://git-lfs.github.com/spec/v1
+oid sha256:b0b85f79a74b52165f69cc6b33dbcec18e2f9aa4f89729452be57b09af1b349f
 size 1200729512