Mamadou2727
/

nllb-fr-dje

Text2Text Generation

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

Mamadou2727 commited on Dec 6, 2023

Commit

68ffb01

•

1 Parent(s): e0fa7af

End of training

Files changed (3) hide show

README.md +11 -7
model.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -17,8 +17,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [facebook/nllb-200-distilled-600M](https://huggingface.co/facebook/nllb-200-distilled-600M) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.4331
-- Bleu: 27.0215
 ## Model description
@@ -45,17 +45,21 @@ The following hyperparameters were used during training:
 - total_train_batch_size: 64
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 4
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Bleu    |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|
-| 2.8664        | 1.0   | 508  | 1.8497          | 20.6335 |
-| 1.7995        | 2.0   | 1017 | 1.5528          | 25.3333 |
-| 1.5732        | 3.0   | 1526 | 1.4589          | 26.7125 |
-| 1.4864        | 3.99  | 2032 | 1.4331          | 27.0215 |
 ### Framework versions

 This model is a fine-tuned version of [facebook/nllb-200-distilled-600M](https://huggingface.co/facebook/nllb-200-distilled-600M) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.2285
+- Bleu: 29.6795
 ## Model description
 - total_train_batch_size: 64
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 8
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Bleu    |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|
+| 2.8423        | 1.0   | 508  | 1.8073          | 20.0301 |
+| 1.7444        | 2.0   | 1017 | 1.4882          | 25.3031 |
+| 1.4902        | 3.0   | 1526 | 1.3685          | 27.3325 |
+| 1.3532        | 4.0   | 2035 | 1.3004          | 28.0924 |
+| 1.2749        | 5.0   | 2544 | 1.2632          | 29.0171 |
+| 1.2153        | 6.0   | 3053 | 1.2432          | 29.3495 |
+| 1.1793        | 7.0   | 3562 | 1.2323          | 29.5367 |
+| 1.1548        | 7.99  | 4064 | 1.2285          | 29.6795 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f97701f5faeda7c15997585f5d5aebeffaee01a219f71cbcb525ac8e9a52cbb7
 size 2460354912

 version https://git-lfs.github.com/spec/v1
+oid sha256:ad1be21cebdf6230f3cb57f676fde677f3d39ebe51fcadd6f5cf2091a36957f0
 size 2460354912

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f3ae9932bb7832478bd2d1921c353976ca606d9d426f49cd391ad87a4e598610
 size 4283

 version https://git-lfs.github.com/spec/v1
+oid sha256:06680ede8d19730a3c261c73cdcef4aff2ba6f296c78a6c12bc7f6b36c85fe8e
 size 4283