g-assismoraes commited on
Commit
a8d2b61
1 Parent(s): f5d6376

End of training

Browse files
README.md ADDED
@@ -0,0 +1,79 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ license: mit
4
+ base_model: microsoft/deberta-v3-base
5
+ tags:
6
+ - generated_from_trainer
7
+ model-index:
8
+ - name: deberta-semeval25_EN08_CC_fold4
9
+ results: []
10
+ ---
11
+
12
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
13
+ should probably proofread and complete it, then remove this comment. -->
14
+
15
+ # deberta-semeval25_EN08_CC_fold4
16
+
17
+ This model is a fine-tuned version of [microsoft/deberta-v3-base](https://huggingface.co/microsoft/deberta-v3-base) on the None dataset.
18
+ It achieves the following results on the evaluation set:
19
+ - Loss: 9.2901
20
+ - Precision Samples: 0.2580
21
+ - Recall Samples: 0.6139
22
+ - F1 Samples: 0.3025
23
+ - Precision Macro: 0.8246
24
+ - Recall Macro: 0.3675
25
+ - F1 Macro: 0.2397
26
+ - Precision Micro: 0.2267
27
+ - Recall Micro: 0.4535
28
+ - F1 Micro: 0.3023
29
+ - Precision Weighted: 0.6457
30
+ - Recall Weighted: 0.4535
31
+ - F1 Weighted: 0.2035
32
+
33
+ ## Model description
34
+
35
+ More information needed
36
+
37
+ ## Intended uses & limitations
38
+
39
+ More information needed
40
+
41
+ ## Training and evaluation data
42
+
43
+ More information needed
44
+
45
+ ## Training procedure
46
+
47
+ ### Training hyperparameters
48
+
49
+ The following hyperparameters were used during training:
50
+ - learning_rate: 2e-05
51
+ - train_batch_size: 8
52
+ - eval_batch_size: 8
53
+ - seed: 42
54
+ - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
55
+ - lr_scheduler_type: linear
56
+ - num_epochs: 10
57
+
58
+ ### Training results
59
+
60
+ | Training Loss | Epoch | Step | Validation Loss | Precision Samples | Recall Samples | F1 Samples | Precision Macro | Recall Macro | F1 Macro | Precision Micro | Recall Micro | F1 Micro | Precision Weighted | Recall Weighted | F1 Weighted |
61
+ |:-------------:|:-----:|:----:|:---------------:|:-----------------:|:--------------:|:----------:|:---------------:|:------------:|:--------:|:---------------:|:------------:|:--------:|:------------------:|:---------------:|:-----------:|
62
+ | 8.5826 | 1.0 | 15 | 10.3420 | 1.0 | 0.0 | 0.0 | 1.0 | 0.1951 | 0.1951 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 |
63
+ | 8.6265 | 2.0 | 30 | 10.0401 | 0.2222 | 0.3833 | 0.2629 | 0.9437 | 0.2622 | 0.2216 | 0.2317 | 0.2209 | 0.2262 | 0.8171 | 0.2209 | 0.0948 |
64
+ | 7.4706 | 3.0 | 45 | 9.8509 | 0.2167 | 0.3278 | 0.1891 | 0.9180 | 0.2530 | 0.2189 | 0.1905 | 0.1860 | 0.1882 | 0.7653 | 0.1860 | 0.0844 |
65
+ | 7.3649 | 4.0 | 60 | 9.7126 | 0.3056 | 0.4333 | 0.2617 | 0.9250 | 0.2896 | 0.2308 | 0.2330 | 0.2791 | 0.2540 | 0.7841 | 0.2791 | 0.1163 |
66
+ | 6.8711 | 5.0 | 75 | 9.5275 | 0.4300 | 0.4222 | 0.2444 | 0.9056 | 0.2866 | 0.2368 | 0.25 | 0.2674 | 0.2584 | 0.6815 | 0.2674 | 0.1312 |
67
+ | 7.2056 | 6.0 | 90 | 9.4326 | 0.2260 | 0.4778 | 0.2542 | 0.8618 | 0.3154 | 0.2447 | 0.2090 | 0.3256 | 0.2545 | 0.6282 | 0.3256 | 0.1408 |
68
+ | 7.7788 | 7.0 | 105 | 9.6122 | 0.2333 | 0.5278 | 0.2710 | 0.8411 | 0.3407 | 0.2519 | 0.2162 | 0.3721 | 0.2735 | 0.6179 | 0.3721 | 0.1617 |
69
+ | 7.2763 | 8.0 | 120 | 9.3189 | 0.2293 | 0.5139 | 0.2608 | 0.8449 | 0.3382 | 0.2558 | 0.2138 | 0.3605 | 0.2684 | 0.6243 | 0.3605 | 0.1670 |
70
+ | 7.3285 | 9.0 | 135 | 9.2802 | 0.2552 | 0.6083 | 0.3031 | 0.8249 | 0.3650 | 0.2397 | 0.2331 | 0.4419 | 0.3052 | 0.6451 | 0.4419 | 0.2011 |
71
+ | 7.7699 | 10.0 | 150 | 9.2901 | 0.2580 | 0.6139 | 0.3025 | 0.8246 | 0.3675 | 0.2397 | 0.2267 | 0.4535 | 0.3023 | 0.6457 | 0.4535 | 0.2035 |
72
+
73
+
74
+ ### Framework versions
75
+
76
+ - Transformers 4.46.0
77
+ - Pytorch 2.3.1
78
+ - Datasets 2.21.0
79
+ - Tokenizers 0.20.1
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:43c865be381819b03006046c19a68646d868060bb53664eececd1acb0d521797
3
  size 737839252
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d548d2a22d5d77037969111a61469686b03f204b9f935e599ced8b1c40f6ace1
3
  size 737839252
runs/Oct28_12-55-56_icuff-Z790-UD/events.out.tfevents.1730130957.icuff-Z790-UD.1041902.6 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f1f0b7238887a3fac6c2e99130a446a6bc289185af4a5bc19d7c8aefc2122482
3
- size 13062
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c48fd14ea77e4c619e78ff791dc92d349f70bff0ffe348c2bc5affcb077b5f53
3
+ size 19803