basso4's picture
End of training
3d47700
metadata
tags:
  - generated_from_trainer
datasets:
  - mt_eng_vietnamese
metrics:
  - bleu
model-index:
  - name: basso4_eng_to_vie_model
    results:
      - task:
          name: Sequence-to-sequence Language Modeling
          type: text2text-generation
        dataset:
          name: mt_eng_vietnamese
          type: mt_eng_vietnamese
          config: iwslt2015-en-vi
          split: train
          args: iwslt2015-en-vi
        metrics:
          - name: Bleu
            type: bleu
            value: 1.3777

basso4_eng_to_vie_model

This model was trained from scratch on the mt_eng_vietnamese dataset. It achieves the following results on the evaluation set:

  • Loss: 0.9999
  • Bleu: 1.3777
  • Gen Len: 18.7953

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 16
  • eval_batch_size: 16
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 10

Training results

Training Loss Epoch Step Validation Loss Bleu Gen Len
1.3485 1.0 6666 1.2151 0.9294 18.7893
1.2969 2.0 13332 1.1510 1.0593 18.7881
1.2426 3.0 19998 1.1053 1.1598 18.7873
1.2067 4.0 26664 1.0713 1.2393 18.789
1.1809 5.0 33330 1.0469 1.2853 18.7894
1.1575 6.0 39996 1.0288 1.3227 18.7899
1.1407 7.0 46662 1.0154 1.3405 18.7937
1.1295 8.0 53328 1.0071 1.3657 18.7943
1.1241 9.0 59994 1.0014 1.3753 18.7953
1.1209 10.0 66660 0.9999 1.3777 18.7953

Framework versions

  • Transformers 4.33.0
  • Pytorch 2.0.0
  • Datasets 2.1.0
  • Tokenizers 0.13.3