Gen_Z_Model / README.md
GCruz19's picture
update model card README.md
3093ef4
|
raw
history blame
4.84 kB
metadata
license: apache-2.0
base_model: t5-small
tags:
  - generated_from_trainer
metrics:
  - bleu
model-index:
  - name: Gen_Z_Model
    results: []

Gen_Z_Model

This model is a fine-tuned version of t5-small on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 1.2083
  • Bleu: 38.8455
  • Gen Len: 15.0467

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 16
  • eval_batch_size: 16
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 50

Training results

Training Loss Epoch Step Validation Loss Bleu Gen Len
No log 1.0 107 1.9909 28.2199 15.1893
No log 2.0 214 1.7933 32.7292 15.2734
No log 3.0 321 1.7042 33.0586 15.3575
No log 4.0 428 1.6409 33.5589 15.3294
1.9663 5.0 535 1.5944 34.0231 15.3084
1.9663 6.0 642 1.5542 34.5356 15.2453
1.9663 7.0 749 1.5204 34.5257 15.3178
1.9663 8.0 856 1.4949 35.0464 15.2664
1.9663 9.0 963 1.4656 34.8031 15.3692
1.563 10.0 1070 1.4452 34.8213 15.3248
1.563 11.0 1177 1.4273 34.8319 15.3715
1.563 12.0 1284 1.4041 34.6139 15.528
1.563 13.0 1391 1.3904 34.8305 15.4439
1.563 14.0 1498 1.3747 35.4972 15.5327
1.4209 15.0 1605 1.3619 35.7394 15.4322
1.4209 16.0 1712 1.3493 35.6452 15.4206
1.4209 17.0 1819 1.3369 35.8997 15.4276
1.4209 18.0 1926 1.3255 35.8844 15.4416
1.3222 19.0 2033 1.3168 35.8468 15.465
1.3222 20.0 2140 1.3074 36.3525 15.3621
1.3222 21.0 2247 1.2993 37.2694 15.2453
1.3222 22.0 2354 1.2925 37.3457 15.2593
1.3222 23.0 2461 1.2842 37.3279 15.236
1.2566 24.0 2568 1.2805 37.4183 15.2056
1.2566 25.0 2675 1.2750 37.7844 15.1939
1.2566 26.0 2782 1.2684 37.8613 15.1799
1.2566 27.0 2889 1.2626 37.8746 15.1519
1.2566 28.0 2996 1.2562 38.017 15.1495
1.1991 29.0 3103 1.2536 38.1961 15.1145
1.1991 30.0 3210 1.2473 38.2285 15.0981
1.1991 31.0 3317 1.2429 38.214 15.1028
1.1991 32.0 3424 1.2397 38.5427 15.0467
1.1655 33.0 3531 1.2353 38.2303 15.1121
1.1655 34.0 3638 1.2344 38.5399 15.1285
1.1655 35.0 3745 1.2288 38.4536 15.1005
1.1655 36.0 3852 1.2263 38.7325 15.0794
1.1655 37.0 3959 1.2237 38.7098 15.1051
1.1306 38.0 4066 1.2202 38.6696 15.1215
1.1306 39.0 4173 1.2182 38.8038 15.0771
1.1306 40.0 4280 1.2171 38.846 15.0561
1.1306 41.0 4387 1.2162 38.7233 15.0257
1.1306 42.0 4494 1.2144 38.7516 15.0327
1.1103 43.0 4601 1.2136 39.1562 15.0304
1.1103 44.0 4708 1.2115 38.9924 15.021
1.1103 45.0 4815 1.2104 39.0094 15.035
1.1103 46.0 4922 1.2097 38.9355 15.0421
1.0979 47.0 5029 1.2087 38.8939 15.0561
1.0979 48.0 5136 1.2087 38.8412 15.0491
1.0979 49.0 5243 1.2084 38.8575 15.0561
1.0979 50.0 5350 1.2083 38.8455 15.0467

Framework versions

  • Transformers 4.31.0
  • Pytorch 2.0.1+cu118
  • Datasets 2.14.3
  • Tokenizers 0.13.3