cu-go-bart-large-gc / README.md
sammyj4148's picture
Model save
e2c8fd6
metadata
license: apache-2.0
base_model: facebook/bart-large
tags:
  - generated_from_trainer
metrics:
  - rouge
model-index:
  - name: cu-go-bart-large-gc
    results: []

cu-go-bart-large-gc

This model is a fine-tuned version of facebook/bart-large on an unknown dataset. It achieves the following results on the evaluation set:

  • Loss: 1.3380
  • Rouge1: 56.6424
  • Rouge2: 31.6294
  • Rougel: 38.8938
  • Rougelsum: 51.9078
  • Gen Len: 119.4535

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 5e-05
  • train_batch_size: 4
  • eval_batch_size: 4
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 3.0

Training results

Training Loss Epoch Step Validation Loss Rouge1 Rouge2 Rougel Rougelsum Gen Len
No log 1.0 86 1.3532 54.8564 29.5263 36.6465 50.2558 116.6512
No log 2.0 172 1.3118 56.6239 31.6121 39.2945 51.7651 117.9419
No log 3.0 258 1.3380 56.6424 31.6294 38.8938 51.9078 119.4535

Framework versions

  • Transformers 4.35.0.dev0
  • Pytorch 2.0.1+cu117
  • Datasets 2.12.0
  • Tokenizers 0.14.1