kdk07718's picture
Training complete
66dd9e1 verified
|
raw
history blame
1.88 kB
metadata
license: apache-2.0
base_model: t5-small
tags:
  - summarization
  - generated_from_trainer
metrics:
  - rouge
model-index:
  - name: t5-small-finetuned-cnn-news
    results: []

t5-small-finetuned-cnn-news

This model is a fine-tuned version of t5-small on an unknown dataset. It achieves the following results on the evaluation set:

  • Loss: 1.8421
  • Rouge1: 24.4309
  • Rouge2: 12.1268
  • Rougel: 20.3697
  • Rougelsum: 23.18

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 0.00056
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 5

Training results

Training Loss Epoch Step Validation Loss Rouge1 Rouge2 Rougel Rougelsum
2.0267 1.0 718 1.8134 24.5086 12.0372 20.3241 23.2338
1.8289 2.0 1436 1.8150 24.4861 12.1833 20.5262 23.3358
1.6833 3.0 2154 1.8148 23.9202 11.7941 19.9514 22.7185
1.576 4.0 2872 1.8271 24.2367 11.8778 20.1292 23.0104
1.4965 5.0 3590 1.8421 24.4309 12.1268 20.3697 23.18

Framework versions

  • Transformers 4.42.4
  • Pytorch 2.3.1+cu121
  • Datasets 2.20.0
  • Tokenizers 0.19.1