haryoaw's picture
Initial Commit
39af9ed verified
metadata
license: mit
base_model: haryoaw/scenario-MDBT-TCR_data-cl-cardiff_cl_only
tags:
  - generated_from_trainer
metrics:
  - accuracy
  - f1
model-index:
  - name: scenario-KD-PO-CDF-EN-FROM-CL-D2_data-en-cardiff_eng_only55
    results: []

scenario-KD-PO-CDF-EN-FROM-CL-D2_data-en-cardiff_eng_only55

This model is a fine-tuned version of haryoaw/scenario-MDBT-TCR_data-cl-cardiff_cl_only on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 39.4604
  • Accuracy: 0.4537
  • F1: 0.4426

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 5e-05
  • train_batch_size: 32
  • eval_batch_size: 32
  • seed: 55
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 30

Training results

Training Loss Epoch Step Validation Loss Accuracy F1
No log 1.72 100 24.2103 0.4616 0.4518
No log 3.45 200 29.3271 0.4255 0.3714
No log 5.17 300 29.8979 0.4489 0.4414
No log 6.9 400 31.7211 0.4669 0.4627
15.3839 8.62 500 37.0421 0.4581 0.4492
15.3839 10.34 600 32.5884 0.4669 0.4658
15.3839 12.07 700 39.9517 0.4493 0.4332
15.3839 13.79 800 38.5249 0.4630 0.4470
15.3839 15.52 900 38.7918 0.4414 0.4286
2.5437 17.24 1000 40.0345 0.4524 0.4391
2.5437 18.97 1100 38.3918 0.4612 0.4527
2.5437 20.69 1200 41.3974 0.4396 0.4169
2.5437 22.41 1300 38.7372 0.4603 0.4532
2.5437 24.14 1400 40.1541 0.4405 0.4288
1.0429 25.86 1500 40.0459 0.4568 0.4383
1.0429 27.59 1600 39.3779 0.4590 0.4457
1.0429 29.31 1700 39.4604 0.4537 0.4426

Framework versions

  • Transformers 4.33.3
  • Pytorch 2.1.1+cu121
  • Datasets 2.14.5
  • Tokenizers 0.13.3