Edit model card

wav2vec2_ASV_deepfake_audio_detection

This model is a fine-tuned version of facebook/wav2vec2-base on an unknown dataset. It achieves the following results on the evaluation set:

  • Loss: 0.5628
  • Accuracy: 0.8999
  • Precision: 0.9057
  • F1: 0.8612
  • Tp: 181
  • Tn: 16068
  • Fn: 1800
  • Fp: 8
  • Auc Roc: 0.9372

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 5e-05
  • train_batch_size: 100
  • eval_batch_size: 100
  • seed: 42
  • gradient_accumulation_steps: 4
  • total_train_batch_size: 400
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • lr_scheduler_warmup_ratio: 0.1
  • num_epochs: 5
  • mixed_precision_training: Native AMP

Training results

Training Loss Epoch Step Validation Loss Accuracy Precision F1 Tp Tn Fn Fp Auc Roc
0.693 0.1143 10 0.6628 0.8854 0.8117 0.8385 23 15964 1958 112 0.5001
0.6589 0.2286 20 0.4915 0.8903 0.7926 0.8386 0 16076 1981 0 0.5030
0.5546 0.3429 30 0.3825 0.8865 0.8231 0.8406 39 15969 1942 107 0.5748
0.3566 0.4571 40 0.3403 0.8909 0.8620 0.8419 28 16059 1953 17 0.6201
0.2115 0.5714 50 0.3617 0.8923 0.8908 0.8442 43 16070 1938 6 0.7028
0.1636 0.6857 60 0.3428 0.8958 0.8756 0.8586 182 15993 1799 83 0.7968
0.1415 0.8 70 0.3899 0.8925 0.9015 0.8440 41 16075 1940 1 0.6722
0.11 0.9143 80 0.3756 0.8930 0.9024 0.8452 50 16075 1931 1 0.7490
0.1041 1.0286 90 0.3885 0.8960 0.9006 0.8526 110 16069 1871 7 0.6362
0.0888 1.1429 100 0.3484 0.8995 0.8936 0.8630 207 16036 1774 40 0.8231
0.0669 1.2571 110 0.3386 0.9049 0.9040 0.8734 299 16041 1682 35 0.8354
0.0552 1.3714 120 0.4530 0.8942 0.9055 0.8480 71 16076 1910 0 0.8554
0.071 1.4857 130 0.4327 0.8963 0.8937 0.8545 128 16057 1853 19 0.8543
0.0665 1.6 140 0.4547 0.8947 0.9045 0.8491 80 16075 1901 1 0.8065
0.054 1.7143 150 0.3210 0.9148 0.9064 0.8970 592 15926 1389 150 0.8851
0.0575 1.8286 160 0.4901 0.8934 0.9012 0.8462 58 16074 1923 2 0.7591
0.0437 1.9429 170 0.4849 0.8979 0.9036 0.8568 144 16069 1837 7 0.6435
0.0471 2.0571 180 0.3822 0.9071 0.9103 0.8767 324 16056 1657 20 0.9277
0.0377 2.1714 190 0.5301 0.8928 0.8962 0.8450 49 16072 1932 4 0.9112
0.0327 2.2857 200 0.5534 0.8920 0.9036 0.8426 30 16076 1951 0 0.8755
0.0522 2.4 210 0.2332 0.9260 0.9192 0.9162 865 15856 1116 220 0.9448
0.0449 2.5143 220 0.3034 0.9102 0.9104 0.8835 397 16038 1584 38 0.9453
0.0338 2.6286 230 0.4001 0.9018 0.9072 0.8654 218 16066 1763 10 0.9153
0.0337 2.7429 240 0.4761 0.8973 0.9056 0.8552 130 16073 1851 3 0.8789
0.0347 2.8571 250 0.5613 0.8921 0.9037 0.8429 32 16076 1949 0 0.9068
0.0301 2.9714 260 0.4896 0.8967 0.9025 0.8540 121 16070 1860 6 0.9480
0.0208 3.0857 270 0.5223 0.8983 0.9053 0.8575 149 16071 1832 5 0.9471
0.0197 3.2 280 0.5003 0.9024 0.9068 0.8669 232 16063 1749 13 0.9445
0.0167 3.3143 290 0.4328 0.9087 0.9123 0.8796 351 16057 1630 19 0.9561
0.0235 3.4286 300 0.3612 0.9097 0.9115 0.8821 380 16047 1601 29 0.9596
0.0207 3.5429 310 0.3538 0.9158 0.9169 0.8934 498 16038 1483 38 0.9591
0.0192 3.6571 320 0.4185 0.9145 0.9171 0.8907 465 16049 1516 27 0.9404
0.0176 3.7714 330 0.6594 0.8926 0.9017 0.8443 43 16075 1938 1 0.8734
0.0174 3.8857 340 0.5727 0.8995 0.9073 0.8600 170 16072 1811 4 0.9276
0.021 4.0 350 0.5943 0.8937 0.8988 0.8471 65 16072 1916 4 0.9460
0.02 4.1143 360 0.5183 0.8982 0.9040 0.8574 149 16069 1832 7 0.9507
0.015 4.2286 370 0.5329 0.8980 0.9037 0.8570 146 16069 1835 7 0.9477
0.0139 4.3429 380 0.5545 0.8967 0.9017 0.8541 122 16069 1859 7 0.9438
0.0103 4.4571 390 0.5638 0.8969 0.9021 0.8546 126 16069 1855 7 0.9403
0.0099 4.5714 400 0.5094 0.9030 0.9078 0.8679 241 16064 1740 12 0.9419
0.0121 4.6857 410 0.5066 0.9049 0.9099 0.8717 275 16064 1706 12 0.9406
0.0122 4.8 420 0.5700 0.8992 0.9047 0.8596 168 16068 1813 8 0.9326
0.0155 4.9143 430 0.5628 0.8999 0.9057 0.8612 181 16068 1800 8 0.9372

Framework versions

  • Transformers 4.44.1
  • Pytorch 2.2.1+cu121
  • Datasets 2.21.0
  • Tokenizers 0.19.1
Downloads last month
13
Safetensors
Model size
94.6M params
Tensor type
F32
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for Bisher/wav2vec2_ASV_deepfake_audio_detection

Finetuned
(639)
this model
Finetunes
4 models