alex-miller commited on
Commit
4b7faf7
1 Parent(s): 18e6845

End of training

Browse files
README.md CHANGED
@@ -20,11 +20,11 @@ should probably proofread and complete it, then remove this comment. -->
20
 
21
  This model is a fine-tuned version of [alex-miller/ODABert](https://huggingface.co/alex-miller/ODABert) on an unknown dataset.
22
  It achieves the following results on the evaluation set:
23
- - Loss: 0.1854
24
- - Accuracy: 0.9286
25
- - F1: 0.9231
26
- - Precision: 0.9231
27
- - Recall: 0.9231
28
 
29
  ## Model description
30
 
@@ -43,7 +43,7 @@ More information needed
43
  ### Training hyperparameters
44
 
45
  The following hyperparameters were used during training:
46
- - learning_rate: 5e-06
47
  - train_batch_size: 32
48
  - eval_batch_size: 32
49
  - seed: 42
@@ -55,26 +55,26 @@ The following hyperparameters were used during training:
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 | Precision | Recall |
57
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
58
- | 0.6851 | 1.0 | 4 | 0.6649 | 0.7143 | 0.7333 | 0.6471 | 0.8462 |
59
- | 0.6437 | 2.0 | 8 | 0.6270 | 0.7857 | 0.7857 | 0.7333 | 0.8462 |
60
- | 0.6076 | 3.0 | 12 | 0.5887 | 0.8571 | 0.8462 | 0.8462 | 0.8462 |
61
- | 0.5637 | 4.0 | 16 | 0.5442 | 0.8571 | 0.8462 | 0.8462 | 0.8462 |
62
- | 0.5203 | 5.0 | 20 | 0.4945 | 0.8571 | 0.8462 | 0.8462 | 0.8462 |
63
- | 0.459 | 6.0 | 24 | 0.4436 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
64
- | 0.4076 | 7.0 | 28 | 0.3976 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
65
- | 0.3561 | 8.0 | 32 | 0.3545 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
66
- | 0.3095 | 9.0 | 36 | 0.3180 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
67
- | 0.2629 | 10.0 | 40 | 0.2887 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
68
- | 0.2282 | 11.0 | 44 | 0.2677 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
69
- | 0.2095 | 12.0 | 48 | 0.2493 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
70
- | 0.1808 | 13.0 | 52 | 0.2326 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
71
- | 0.1466 | 14.0 | 56 | 0.2186 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
72
- | 0.1408 | 15.0 | 60 | 0.2075 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
73
- | 0.1191 | 16.0 | 64 | 0.1993 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
74
- | 0.1185 | 17.0 | 68 | 0.1935 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
75
- | 0.1084 | 18.0 | 72 | 0.1891 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
76
- | 0.1119 | 19.0 | 76 | 0.1865 | 0.9286 | 0.9231 | 0.9231 | 0.9231 |
77
- | 0.1017 | 20.0 | 80 | 0.1854 | 0.9286 | 0.9231 | 0.9231 | 0.9231 |
78
 
79
 
80
  ### Framework versions
 
20
 
21
  This model is a fine-tuned version of [alex-miller/ODABert](https://huggingface.co/alex-miller/ODABert) on an unknown dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 0.1539
24
+ - Accuracy: 0.9643
25
+ - F1: 0.9630
26
+ - Precision: 0.9286
27
+ - Recall: 1.0
28
 
29
  ## Model description
30
 
 
43
  ### Training hyperparameters
44
 
45
  The following hyperparameters were used during training:
46
+ - learning_rate: 6e-06
47
  - train_batch_size: 32
48
  - eval_batch_size: 32
49
  - seed: 42
 
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 | Precision | Recall |
57
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
58
+ | 0.6832 | 1.0 | 4 | 0.6577 | 0.6786 | 0.7097 | 0.6111 | 0.8462 |
59
+ | 0.6332 | 2.0 | 8 | 0.6121 | 0.8571 | 0.8462 | 0.8462 | 0.8462 |
60
+ | 0.587 | 3.0 | 12 | 0.5636 | 0.8571 | 0.8462 | 0.8462 | 0.8462 |
61
+ | 0.5308 | 4.0 | 16 | 0.5053 | 0.8571 | 0.8462 | 0.8462 | 0.8462 |
62
+ | 0.4738 | 5.0 | 20 | 0.4425 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
63
+ | 0.3972 | 6.0 | 24 | 0.3848 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
64
+ | 0.3347 | 7.0 | 28 | 0.3371 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
65
+ | 0.2769 | 8.0 | 32 | 0.2950 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
66
+ | 0.2321 | 9.0 | 36 | 0.2621 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
67
+ | 0.1847 | 10.0 | 40 | 0.2343 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
68
+ | 0.1524 | 11.0 | 44 | 0.2120 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
69
+ | 0.1374 | 12.0 | 48 | 0.1935 | 0.8929 | 0.8800 | 0.9167 | 0.8462 |
70
+ | 0.1112 | 13.0 | 52 | 0.1792 | 0.9286 | 0.9231 | 0.9231 | 0.9231 |
71
+ | 0.0881 | 14.0 | 56 | 0.1687 | 0.9643 | 0.9630 | 0.9286 | 1.0 |
72
+ | 0.0785 | 15.0 | 60 | 0.1623 | 0.9643 | 0.9630 | 0.9286 | 1.0 |
73
+ | 0.065 | 16.0 | 64 | 0.1585 | 0.9643 | 0.9630 | 0.9286 | 1.0 |
74
+ | 0.0625 | 17.0 | 68 | 0.1570 | 0.9643 | 0.9630 | 0.9286 | 1.0 |
75
+ | 0.0566 | 18.0 | 72 | 0.1554 | 0.9643 | 0.9630 | 0.9286 | 1.0 |
76
+ | 0.0587 | 19.0 | 76 | 0.1544 | 0.9643 | 0.9630 | 0.9286 | 1.0 |
77
+ | 0.0537 | 20.0 | 80 | 0.1539 | 0.9643 | 0.9630 | 0.9286 | 1.0 |
78
 
79
 
80
  ### Framework versions
runs/Aug02_13-47-02_a0ae54856f2b/events.out.tfevents.1722606423.a0ae54856f2b.778.3 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ade8058587acdb28a6a4abc5a83006afe818736f8e7a74610eb6349084367fc1
3
- size 18597
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:207f53e2b7f302e1a9f6d8c5b4767fdf249a1b81b16cd1de966c200b886cff43
3
+ size 18945