alex-miller commited on
Commit
bb1d6da
1 Parent(s): 2bdd7a4

End of training

Browse files
README.md CHANGED
@@ -20,11 +20,11 @@ should probably proofread and complete it, then remove this comment. -->
20
 
21
  This model is a fine-tuned version of [alex-miller/ODABert](https://huggingface.co/alex-miller/ODABert) on an unknown dataset.
22
  It achieves the following results on the evaluation set:
23
- - Loss: 0.2017
24
- - Accuracy: 0.9643
25
- - F1: 0.9630
26
- - Precision: 0.9286
27
- - Recall: 1.0
28
 
29
  ## Model description
30
 
@@ -43,9 +43,9 @@ More information needed
43
  ### Training hyperparameters
44
 
45
  The following hyperparameters were used during training:
46
- - learning_rate: 1e-05
47
- - train_batch_size: 16
48
- - eval_batch_size: 16
49
  - seed: 42
50
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
51
  - lr_scheduler_type: linear
@@ -55,26 +55,26 @@ The following hyperparameters were used during training:
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 | Precision | Recall |
57
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
58
- | 0.6608 | 1.0 | 7 | 0.5726 | 0.7857 | 0.7857 | 0.7333 | 0.8462 |
59
- | 0.5107 | 2.0 | 14 | 0.4116 | 0.8571 | 0.8462 | 0.8462 | 0.8462 |
60
- | 0.35 | 3.0 | 21 | 0.3056 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
61
- | 0.2166 | 4.0 | 28 | 0.2485 | 0.8571 | 0.8462 | 0.8462 | 0.8462 |
62
- | 0.1356 | 5.0 | 35 | 0.1940 | 0.9286 | 0.9231 | 0.9231 | 0.9231 |
63
- | 0.0705 | 6.0 | 42 | 0.1848 | 0.9286 | 0.9231 | 0.9231 | 0.9231 |
64
- | 0.0491 | 7.0 | 49 | 0.1728 | 0.9643 | 0.9630 | 0.9286 | 1.0 |
65
- | 0.0229 | 8.0 | 56 | 0.1725 | 0.9643 | 0.9630 | 0.9286 | 1.0 |
66
- | 0.0129 | 9.0 | 63 | 0.1744 | 0.9643 | 0.9630 | 0.9286 | 1.0 |
67
- | 0.0092 | 10.0 | 70 | 0.1793 | 0.9643 | 0.9630 | 0.9286 | 1.0 |
68
- | 0.0075 | 11.0 | 77 | 0.1847 | 0.9643 | 0.9630 | 0.9286 | 1.0 |
69
- | 0.0061 | 12.0 | 84 | 0.1890 | 0.9643 | 0.9630 | 0.9286 | 1.0 |
70
- | 0.0058 | 13.0 | 91 | 0.1928 | 0.9643 | 0.9630 | 0.9286 | 1.0 |
71
- | 0.0049 | 14.0 | 98 | 0.1954 | 0.9643 | 0.9630 | 0.9286 | 1.0 |
72
- | 0.0045 | 15.0 | 105 | 0.1975 | 0.9643 | 0.9630 | 0.9286 | 1.0 |
73
- | 0.0042 | 16.0 | 112 | 0.1990 | 0.9643 | 0.9630 | 0.9286 | 1.0 |
74
- | 0.004 | 17.0 | 119 | 0.2001 | 0.9643 | 0.9630 | 0.9286 | 1.0 |
75
- | 0.0038 | 18.0 | 126 | 0.2010 | 0.9643 | 0.9630 | 0.9286 | 1.0 |
76
- | 0.0038 | 19.0 | 133 | 0.2015 | 0.9643 | 0.9630 | 0.9286 | 1.0 |
77
- | 0.0038 | 20.0 | 140 | 0.2017 | 0.9643 | 0.9630 | 0.9286 | 1.0 |
78
 
79
 
80
  ### Framework versions
 
20
 
21
  This model is a fine-tuned version of [alex-miller/ODABert](https://huggingface.co/alex-miller/ODABert) on an unknown dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 0.1854
24
+ - Accuracy: 0.9286
25
+ - F1: 0.9231
26
+ - Precision: 0.9231
27
+ - Recall: 0.9231
28
 
29
  ## Model description
30
 
 
43
  ### Training hyperparameters
44
 
45
  The following hyperparameters were used during training:
46
+ - learning_rate: 5e-06
47
+ - train_batch_size: 32
48
+ - eval_batch_size: 32
49
  - seed: 42
50
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
51
  - lr_scheduler_type: linear
 
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 | Precision | Recall |
57
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
58
+ | 0.6851 | 1.0 | 4 | 0.6649 | 0.7143 | 0.7333 | 0.6471 | 0.8462 |
59
+ | 0.6437 | 2.0 | 8 | 0.6270 | 0.7857 | 0.7857 | 0.7333 | 0.8462 |
60
+ | 0.6076 | 3.0 | 12 | 0.5887 | 0.8571 | 0.8462 | 0.8462 | 0.8462 |
61
+ | 0.5637 | 4.0 | 16 | 0.5442 | 0.8571 | 0.8462 | 0.8462 | 0.8462 |
62
+ | 0.5203 | 5.0 | 20 | 0.4945 | 0.8571 | 0.8462 | 0.8462 | 0.8462 |
63
+ | 0.459 | 6.0 | 24 | 0.4436 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
64
+ | 0.4076 | 7.0 | 28 | 0.3976 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
65
+ | 0.3561 | 8.0 | 32 | 0.3545 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
66
+ | 0.3095 | 9.0 | 36 | 0.3180 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
67
+ | 0.2629 | 10.0 | 40 | 0.2887 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
68
+ | 0.2282 | 11.0 | 44 | 0.2677 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
69
+ | 0.2095 | 12.0 | 48 | 0.2493 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
70
+ | 0.1808 | 13.0 | 52 | 0.2326 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
71
+ | 0.1466 | 14.0 | 56 | 0.2186 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
72
+ | 0.1408 | 15.0 | 60 | 0.2075 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
73
+ | 0.1191 | 16.0 | 64 | 0.1993 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
74
+ | 0.1185 | 17.0 | 68 | 0.1935 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
75
+ | 0.1084 | 18.0 | 72 | 0.1891 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
76
+ | 0.1119 | 19.0 | 76 | 0.1865 | 0.9286 | 0.9231 | 0.9231 | 0.9231 |
77
+ | 0.1017 | 20.0 | 80 | 0.1854 | 0.9286 | 0.9231 | 0.9231 | 0.9231 |
78
 
79
 
80
  ### Framework versions
runs/Aug02_13-34-55_a0ae54856f2b/events.out.tfevents.1722605696.a0ae54856f2b.778.2 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f2f430159e27125174725dee972b2d092a47a0ceaa79d29c6328582e36048c62
3
- size 18597
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:93fa6961ec6d875f252598f41aac9870a8924014ab7d14eb4e959bcaefa114ef
3
+ size 18945