Initial Commit
Browse files- README.md +15 -13
- eval_results_cardiff.json +1 -1
- pytorch_model.bin +1 -1
- training_args.bin +1 -1
README.md
CHANGED
@@ -23,10 +23,10 @@ model-index:
|
|
23 |
metrics:
|
24 |
- name: Accuracy
|
25 |
type: accuracy
|
26 |
-
value: 0.
|
27 |
- name: F1
|
28 |
type: f1
|
29 |
-
value: 0.
|
30 |
---
|
31 |
|
32 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
@@ -36,9 +36,9 @@ should probably proofread and complete it, then remove this comment. -->
|
|
36 |
|
37 |
This model is a fine-tuned version of [microsoft/mdeberta-v3-base](https://huggingface.co/microsoft/mdeberta-v3-base) on the tweet_sentiment_multilingual dataset.
|
38 |
It achieves the following results on the evaluation set:
|
39 |
-
- Loss: 1.
|
40 |
-
- Accuracy: 0.
|
41 |
-
- F1: 0.
|
42 |
|
43 |
## Model description
|
44 |
|
@@ -58,8 +58,8 @@ More information needed
|
|
58 |
|
59 |
The following hyperparameters were used during training:
|
60 |
- learning_rate: 5e-05
|
61 |
-
- train_batch_size:
|
62 |
-
- eval_batch_size:
|
63 |
- seed: 66
|
64 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
65 |
- lr_scheduler_type: linear
|
@@ -69,12 +69,14 @@ The following hyperparameters were used during training:
|
|
69 |
|
70 |
| Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 |
|
71 |
|:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|
|
72 |
-
| 0.
|
73 |
-
| 0.
|
74 |
-
| 0.
|
75 |
-
| 0.
|
76 |
-
| 0.
|
77 |
-
| 0.
|
|
|
|
|
78 |
|
79 |
|
80 |
### Framework versions
|
|
|
23 |
metrics:
|
24 |
- name: Accuracy
|
25 |
type: accuracy
|
26 |
+
value: 0.6512345679012346
|
27 |
- name: F1
|
28 |
type: f1
|
29 |
+
value: 0.6483011417314103
|
30 |
---
|
31 |
|
32 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
|
|
36 |
|
37 |
This model is a fine-tuned version of [microsoft/mdeberta-v3-base](https://huggingface.co/microsoft/mdeberta-v3-base) on the tweet_sentiment_multilingual dataset.
|
38 |
It achieves the following results on the evaluation set:
|
39 |
+
- Loss: 1.7268
|
40 |
+
- Accuracy: 0.6512
|
41 |
+
- F1: 0.6483
|
42 |
|
43 |
## Model description
|
44 |
|
|
|
58 |
|
59 |
The following hyperparameters were used during training:
|
60 |
- learning_rate: 5e-05
|
61 |
+
- train_batch_size: 32
|
62 |
+
- eval_batch_size: 64
|
63 |
- seed: 66
|
64 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
65 |
- lr_scheduler_type: linear
|
|
|
69 |
|
70 |
| Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 |
|
71 |
|:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|
|
72 |
+
| 0.8937 | 1.09 | 500 | 0.8922 | 0.6304 | 0.6189 |
|
73 |
+
| 0.6912 | 2.17 | 1000 | 0.8900 | 0.6551 | 0.6516 |
|
74 |
+
| 0.527 | 3.26 | 1500 | 0.9088 | 0.6593 | 0.6583 |
|
75 |
+
| 0.3874 | 4.35 | 2000 | 1.1089 | 0.6516 | 0.6470 |
|
76 |
+
| 0.2977 | 5.43 | 2500 | 1.2137 | 0.6408 | 0.6433 |
|
77 |
+
| 0.2397 | 6.52 | 3000 | 1.2022 | 0.6431 | 0.6409 |
|
78 |
+
| 0.203 | 7.61 | 3500 | 1.4913 | 0.6454 | 0.6469 |
|
79 |
+
| 0.1658 | 8.7 | 4000 | 1.7268 | 0.6512 | 0.6483 |
|
80 |
|
81 |
|
82 |
### Framework versions
|
eval_results_cardiff.json
CHANGED
@@ -1 +1 @@
|
|
1 |
-
{"arabic": {"f1": 0.
|
|
|
1 |
+
{"arabic": {"f1": 0.6442740029040547, "accuracy": 0.6402298850574712, "confusion_matrix": [[178, 86, 26], [45, 183, 62], [10, 84, 196]]}, "english": {"f1": 0.7057678593082327, "accuracy": 0.7068965517241379, "confusion_matrix": [[232, 48, 10], [85, 166, 39], [15, 58, 217]]}, "french": {"f1": 0.7057233249171375, "accuracy": 0.7057471264367816, "confusion_matrix": [[217, 47, 26], [32, 217, 41], [32, 78, 180]]}, "german": {"f1": 0.740952127201128, "accuracy": 0.7413793103448276, "confusion_matrix": [[221, 40, 29], [53, 198, 39], [24, 40, 226]]}, "hindi": {"f1": 0.48776920064887846, "accuracy": 0.5045977011494253, "confusion_matrix": [[231, 37, 22], [154, 92, 44], [133, 41, 116]]}, "italian": {"f1": 0.6882304946766388, "accuracy": 0.6873563218390805, "confusion_matrix": [[177, 73, 40], [24, 206, 60], [17, 58, 215]]}, "portuguese": {"f1": 0.7074958241100616, "accuracy": 0.7080459770114943, "confusion_matrix": [[199, 62, 29], [50, 186, 54], [15, 44, 231]]}, "spanish": {"f1": 0.6769025958446248, "accuracy": 0.6770114942528735, "confusion_matrix": [[196, 67, 27], [57, 172, 61], [14, 55, 221]]}, "all": {"f1": 0.6715940327292139, "accuracy": 0.6714080459770115, "confusion_matrix": [[1651, 460, 209], [500, 1420, 400], [260, 458, 1602]]}}
|
pytorch_model.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 1115316658
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f7bdf78b7eefcedd000ecab83910048819c362f4f73c044167bdecfd6d034872
|
3 |
size 1115316658
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4600
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:0996a93393eb044aff203732e1aa3ee82242e1d9e5c91a3b2268d5895adf4ce7
|
3 |
size 4600
|