Initial Commit
Browse files- README.md +113 -113
- eval_results_ml.json +1 -1
- pytorch_model.bin +1 -1
- training_args.bin +2 -2
README.md
CHANGED
@@ -23,10 +23,10 @@ model-index:
|
|
23 |
metrics:
|
24 |
- name: Accuracy
|
25 |
type: accuracy
|
26 |
-
value: 0.
|
27 |
- name: F1
|
28 |
type: f1
|
29 |
-
value: 0.
|
30 |
---
|
31 |
|
32 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
@@ -36,9 +36,9 @@ should probably proofread and complete it, then remove this comment. -->
|
|
36 |
|
37 |
This model is a fine-tuned version of [microsoft/mdeberta-v3-base](https://huggingface.co/microsoft/mdeberta-v3-base) on the massive dataset.
|
38 |
It achieves the following results on the evaluation set:
|
39 |
-
- Loss: 2.
|
40 |
-
- Accuracy: 0.
|
41 |
-
- F1: 0.
|
42 |
|
43 |
## Model description
|
44 |
|
@@ -69,114 +69,114 @@ The following hyperparameters were used during training:
|
|
69 |
|
70 |
| Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 |
|
71 |
|:-------------:|:-----:|:-----:|:---------------:|:--------:|:------:|
|
72 |
-
| No log | 0.28 | 100 |
|
73 |
-
| No log | 0.56 | 200 | 2.
|
74 |
-
| No log | 0.83 | 300 | 1.
|
75 |
-
| No log | 1.11 | 400 | 1.
|
76 |
-
| 1.
|
77 |
-
| 1.
|
78 |
-
| 1.
|
79 |
-
| 1.
|
80 |
-
| 1.
|
81 |
-
| 0.
|
82 |
-
| 0.
|
83 |
-
| 0.
|
84 |
-
| 0.
|
85 |
-
| 0.
|
86 |
-
| 0.
|
87 |
-
| 0.
|
88 |
-
| 0.
|
89 |
-
| 0.
|
90 |
-
| 0.
|
91 |
-
| 0.
|
92 |
-
| 0.
|
93 |
-
| 0.
|
94 |
-
| 0.
|
95 |
-
| 0.
|
96 |
-
| 0.
|
97 |
-
| 0.
|
98 |
-
| 0.
|
99 |
-
| 0.
|
100 |
-
| 0.
|
101 |
-
| 0.
|
102 |
-
| 0.
|
103 |
-
| 0.
|
104 |
-
| 0.
|
105 |
-
| 0.
|
106 |
-
| 0.
|
107 |
-
| 0.
|
108 |
-
| 0.
|
109 |
-
| 0.
|
110 |
-
| 0.
|
111 |
-
| 0.
|
112 |
-
| 0.
|
113 |
-
| 0.
|
114 |
-
| 0.
|
115 |
-
| 0.
|
116 |
-
| 0.
|
117 |
-
| 0.
|
118 |
-
| 0.
|
119 |
-
| 0.
|
120 |
-
| 0.
|
121 |
-
| 0.
|
122 |
-
| 0.
|
123 |
-
| 0.
|
124 |
-
| 0.
|
125 |
-
| 0.
|
126 |
-
| 0.
|
127 |
-
| 0.
|
128 |
-
| 0.
|
129 |
-
| 0.
|
130 |
-
| 0.
|
131 |
-
| 0.
|
132 |
-
| 0.
|
133 |
-
| 0.
|
134 |
-
| 0.
|
135 |
-
| 0.
|
136 |
-
| 0.
|
137 |
-
| 0.
|
138 |
-
| 0.
|
139 |
-
| 0.
|
140 |
-
| 0.
|
141 |
-
| 0.
|
142 |
-
| 0.
|
143 |
-
| 0.
|
144 |
-
| 0.
|
145 |
-
| 0.
|
146 |
-
| 0.
|
147 |
-
| 0.
|
148 |
-
| 0.
|
149 |
-
| 0.
|
150 |
-
| 0.
|
151 |
-
| 0.
|
152 |
-
| 0.
|
153 |
-
| 0.
|
154 |
-
| 0.
|
155 |
-
| 0.
|
156 |
-
| 0.
|
157 |
-
| 0.
|
158 |
-
| 0.
|
159 |
-
| 0.
|
160 |
-
| 0.
|
161 |
-
| 0.
|
162 |
-
| 0.
|
163 |
-
| 0.
|
164 |
-
| 0.
|
165 |
-
| 0.
|
166 |
-
| 0.
|
167 |
-
| 0.
|
168 |
-
| 0.
|
169 |
-
| 0.
|
170 |
-
| 0.
|
171 |
-
| 0.
|
172 |
-
| 0.
|
173 |
-
| 0.
|
174 |
-
| 0.
|
175 |
-
| 0.
|
176 |
-
| 0.
|
177 |
-
| 0.
|
178 |
-
| 0.
|
179 |
-
| 0.
|
180 |
|
181 |
|
182 |
### Framework versions
|
|
|
23 |
metrics:
|
24 |
- name: Accuracy
|
25 |
type: accuracy
|
26 |
+
value: 0.7360994569987366
|
27 |
- name: F1
|
28 |
type: f1
|
29 |
+
value: 0.688120673898054
|
30 |
---
|
31 |
|
32 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
|
|
36 |
|
37 |
This model is a fine-tuned version of [microsoft/mdeberta-v3-base](https://huggingface.co/microsoft/mdeberta-v3-base) on the massive dataset.
|
38 |
It achieves the following results on the evaluation set:
|
39 |
+
- Loss: 2.5243
|
40 |
+
- Accuracy: 0.7361
|
41 |
+
- F1: 0.6881
|
42 |
|
43 |
## Model description
|
44 |
|
|
|
69 |
|
70 |
| Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 |
|
71 |
|:-------------:|:-----:|:-----:|:---------------:|:--------:|:------:|
|
72 |
+
| No log | 0.28 | 100 | 2.9524 | 0.3055 | 0.0759 |
|
73 |
+
| No log | 0.56 | 200 | 2.0666 | 0.4969 | 0.2470 |
|
74 |
+
| No log | 0.83 | 300 | 1.6852 | 0.5878 | 0.3626 |
|
75 |
+
| No log | 1.11 | 400 | 1.4751 | 0.6376 | 0.4564 |
|
76 |
+
| 1.9292 | 1.39 | 500 | 1.4633 | 0.6522 | 0.4887 |
|
77 |
+
| 1.9292 | 1.67 | 600 | 1.4477 | 0.6604 | 0.5016 |
|
78 |
+
| 1.9292 | 1.94 | 700 | 1.3844 | 0.6758 | 0.5746 |
|
79 |
+
| 1.9292 | 2.22 | 800 | 1.3272 | 0.6937 | 0.5857 |
|
80 |
+
| 1.9292 | 2.5 | 900 | 1.2445 | 0.7178 | 0.6192 |
|
81 |
+
| 0.6258 | 2.78 | 1000 | 1.3348 | 0.7128 | 0.6198 |
|
82 |
+
| 0.6258 | 3.06 | 1100 | 1.5354 | 0.6734 | 0.6006 |
|
83 |
+
| 0.6258 | 3.33 | 1200 | 1.4149 | 0.7001 | 0.6258 |
|
84 |
+
| 0.6258 | 3.61 | 1300 | 1.4474 | 0.7032 | 0.6405 |
|
85 |
+
| 0.6258 | 3.89 | 1400 | 1.5031 | 0.7054 | 0.6395 |
|
86 |
+
| 0.3592 | 4.17 | 1500 | 1.3733 | 0.7225 | 0.6669 |
|
87 |
+
| 0.3592 | 4.44 | 1600 | 1.3757 | 0.7317 | 0.6555 |
|
88 |
+
| 0.3592 | 4.72 | 1700 | 1.4694 | 0.7134 | 0.6539 |
|
89 |
+
| 0.3592 | 5.0 | 1800 | 1.4733 | 0.7077 | 0.6461 |
|
90 |
+
| 0.3592 | 5.28 | 1900 | 1.5077 | 0.7232 | 0.6654 |
|
91 |
+
| 0.2316 | 5.56 | 2000 | 1.6396 | 0.7069 | 0.6482 |
|
92 |
+
| 0.2316 | 5.83 | 2100 | 1.5588 | 0.7178 | 0.6599 |
|
93 |
+
| 0.2316 | 6.11 | 2200 | 1.5611 | 0.7206 | 0.6507 |
|
94 |
+
| 0.2316 | 6.39 | 2300 | 1.7029 | 0.7155 | 0.6597 |
|
95 |
+
| 0.2316 | 6.67 | 2400 | 1.7865 | 0.7048 | 0.6407 |
|
96 |
+
| 0.1763 | 6.94 | 2500 | 1.7791 | 0.7065 | 0.6555 |
|
97 |
+
| 0.1763 | 7.22 | 2600 | 1.8013 | 0.7176 | 0.6572 |
|
98 |
+
| 0.1763 | 7.5 | 2700 | 1.8034 | 0.7149 | 0.6578 |
|
99 |
+
| 0.1763 | 7.78 | 2800 | 2.0082 | 0.6872 | 0.6234 |
|
100 |
+
| 0.1763 | 8.06 | 2900 | 2.0108 | 0.7011 | 0.6388 |
|
101 |
+
| 0.1136 | 8.33 | 3000 | 2.0779 | 0.6997 | 0.6513 |
|
102 |
+
| 0.1136 | 8.61 | 3100 | 1.9805 | 0.7122 | 0.6555 |
|
103 |
+
| 0.1136 | 8.89 | 3200 | 2.1014 | 0.7010 | 0.6485 |
|
104 |
+
| 0.1136 | 9.17 | 3300 | 1.9710 | 0.7133 | 0.6537 |
|
105 |
+
| 0.1136 | 9.44 | 3400 | 1.9677 | 0.7152 | 0.6564 |
|
106 |
+
| 0.0964 | 9.72 | 3500 | 2.0902 | 0.7079 | 0.6535 |
|
107 |
+
| 0.0964 | 10.0 | 3600 | 2.0776 | 0.7083 | 0.6529 |
|
108 |
+
| 0.0964 | 10.28 | 3700 | 2.0649 | 0.7191 | 0.6647 |
|
109 |
+
| 0.0964 | 10.56 | 3800 | 2.0690 | 0.7152 | 0.6551 |
|
110 |
+
| 0.0964 | 10.83 | 3900 | 2.1585 | 0.7055 | 0.6513 |
|
111 |
+
| 0.0721 | 11.11 | 4000 | 2.0158 | 0.7236 | 0.6660 |
|
112 |
+
| 0.0721 | 11.39 | 4100 | 2.1559 | 0.7120 | 0.6616 |
|
113 |
+
| 0.0721 | 11.67 | 4200 | 2.0517 | 0.7253 | 0.6694 |
|
114 |
+
| 0.0721 | 11.94 | 4300 | 2.1721 | 0.7219 | 0.6662 |
|
115 |
+
| 0.0721 | 12.22 | 4400 | 2.2949 | 0.7079 | 0.6680 |
|
116 |
+
| 0.0448 | 12.5 | 4500 | 2.1676 | 0.7186 | 0.6685 |
|
117 |
+
| 0.0448 | 12.78 | 4600 | 2.0882 | 0.7227 | 0.6636 |
|
118 |
+
| 0.0448 | 13.06 | 4700 | 2.0149 | 0.7335 | 0.6736 |
|
119 |
+
| 0.0448 | 13.33 | 4800 | 2.2128 | 0.7243 | 0.6667 |
|
120 |
+
| 0.0448 | 13.61 | 4900 | 2.2664 | 0.7200 | 0.6577 |
|
121 |
+
| 0.0371 | 13.89 | 5000 | 2.3489 | 0.7100 | 0.6656 |
|
122 |
+
| 0.0371 | 14.17 | 5100 | 2.3454 | 0.7087 | 0.6531 |
|
123 |
+
| 0.0371 | 14.44 | 5200 | 2.2062 | 0.7296 | 0.6767 |
|
124 |
+
| 0.0371 | 14.72 | 5300 | 2.4544 | 0.7101 | 0.6661 |
|
125 |
+
| 0.0371 | 15.0 | 5400 | 2.2581 | 0.7275 | 0.6683 |
|
126 |
+
| 0.0227 | 15.28 | 5500 | 2.2904 | 0.7242 | 0.6697 |
|
127 |
+
| 0.0227 | 15.56 | 5600 | 2.3484 | 0.7152 | 0.6495 |
|
128 |
+
| 0.0227 | 15.83 | 5700 | 2.4505 | 0.7126 | 0.6599 |
|
129 |
+
| 0.0227 | 16.11 | 5800 | 2.2985 | 0.7236 | 0.6673 |
|
130 |
+
| 0.0227 | 16.39 | 5900 | 2.3929 | 0.7245 | 0.6751 |
|
131 |
+
| 0.022 | 16.67 | 6000 | 2.4606 | 0.7200 | 0.6643 |
|
132 |
+
| 0.022 | 16.94 | 6100 | 2.3481 | 0.7276 | 0.6689 |
|
133 |
+
| 0.022 | 17.22 | 6200 | 2.3302 | 0.7273 | 0.6724 |
|
134 |
+
| 0.022 | 17.5 | 6300 | 2.3566 | 0.7292 | 0.6787 |
|
135 |
+
| 0.022 | 17.78 | 6400 | 2.3972 | 0.7281 | 0.6785 |
|
136 |
+
| 0.0133 | 18.06 | 6500 | 2.5105 | 0.7205 | 0.6705 |
|
137 |
+
| 0.0133 | 18.33 | 6600 | 2.3785 | 0.7295 | 0.6775 |
|
138 |
+
| 0.0133 | 18.61 | 6700 | 2.4367 | 0.7220 | 0.6676 |
|
139 |
+
| 0.0133 | 18.89 | 6800 | 2.4496 | 0.7255 | 0.6690 |
|
140 |
+
| 0.0133 | 19.17 | 6900 | 2.4133 | 0.7279 | 0.6720 |
|
141 |
+
| 0.0097 | 19.44 | 7000 | 2.5588 | 0.7140 | 0.6652 |
|
142 |
+
| 0.0097 | 19.72 | 7100 | 2.4906 | 0.7210 | 0.6656 |
|
143 |
+
| 0.0097 | 20.0 | 7200 | 2.5187 | 0.7199 | 0.6619 |
|
144 |
+
| 0.0097 | 20.28 | 7300 | 2.4627 | 0.7254 | 0.6686 |
|
145 |
+
| 0.0097 | 20.56 | 7400 | 2.5543 | 0.7187 | 0.6615 |
|
146 |
+
| 0.0096 | 20.83 | 7500 | 2.4262 | 0.7259 | 0.6676 |
|
147 |
+
| 0.0096 | 21.11 | 7600 | 2.4768 | 0.7256 | 0.6699 |
|
148 |
+
| 0.0096 | 21.39 | 7700 | 2.5336 | 0.7220 | 0.6724 |
|
149 |
+
| 0.0096 | 21.67 | 7800 | 2.5221 | 0.7240 | 0.6703 |
|
150 |
+
| 0.0096 | 21.94 | 7900 | 2.5008 | 0.7269 | 0.6712 |
|
151 |
+
| 0.0086 | 22.22 | 8000 | 2.4998 | 0.7278 | 0.6703 |
|
152 |
+
| 0.0086 | 22.5 | 8100 | 2.4611 | 0.7319 | 0.6842 |
|
153 |
+
| 0.0086 | 22.78 | 8200 | 2.5119 | 0.7313 | 0.6832 |
|
154 |
+
| 0.0086 | 23.06 | 8300 | 2.4329 | 0.7300 | 0.6764 |
|
155 |
+
| 0.0086 | 23.33 | 8400 | 2.4080 | 0.7317 | 0.6822 |
|
156 |
+
| 0.007 | 23.61 | 8500 | 2.4054 | 0.7313 | 0.6802 |
|
157 |
+
| 0.007 | 23.89 | 8600 | 2.4345 | 0.7334 | 0.6851 |
|
158 |
+
| 0.007 | 24.17 | 8700 | 2.4735 | 0.7326 | 0.6865 |
|
159 |
+
| 0.007 | 24.44 | 8800 | 2.4718 | 0.7313 | 0.6843 |
|
160 |
+
| 0.007 | 24.72 | 8900 | 2.4391 | 0.7328 | 0.6818 |
|
161 |
+
| 0.0029 | 25.0 | 9000 | 2.5152 | 0.7290 | 0.6869 |
|
162 |
+
| 0.0029 | 25.28 | 9100 | 2.4609 | 0.7365 | 0.6908 |
|
163 |
+
| 0.0029 | 25.56 | 9200 | 2.4717 | 0.7359 | 0.6932 |
|
164 |
+
| 0.0029 | 25.83 | 9300 | 2.5283 | 0.7337 | 0.6881 |
|
165 |
+
| 0.0029 | 26.11 | 9400 | 2.4831 | 0.7342 | 0.6866 |
|
166 |
+
| 0.0026 | 26.39 | 9500 | 2.5291 | 0.7325 | 0.6861 |
|
167 |
+
| 0.0026 | 26.67 | 9600 | 2.5201 | 0.7344 | 0.6855 |
|
168 |
+
| 0.0026 | 26.94 | 9700 | 2.5496 | 0.7322 | 0.6857 |
|
169 |
+
| 0.0026 | 27.22 | 9800 | 2.5302 | 0.7332 | 0.6853 |
|
170 |
+
| 0.0026 | 27.5 | 9900 | 2.5388 | 0.7329 | 0.6871 |
|
171 |
+
| 0.0025 | 27.78 | 10000 | 2.5210 | 0.7326 | 0.6845 |
|
172 |
+
| 0.0025 | 28.06 | 10100 | 2.5482 | 0.7319 | 0.6841 |
|
173 |
+
| 0.0025 | 28.33 | 10200 | 2.5628 | 0.7315 | 0.6853 |
|
174 |
+
| 0.0025 | 28.61 | 10300 | 2.5439 | 0.7341 | 0.6870 |
|
175 |
+
| 0.0025 | 28.89 | 10400 | 2.5241 | 0.7356 | 0.6875 |
|
176 |
+
| 0.001 | 29.17 | 10500 | 2.5238 | 0.7354 | 0.6873 |
|
177 |
+
| 0.001 | 29.44 | 10600 | 2.5186 | 0.7362 | 0.6880 |
|
178 |
+
| 0.001 | 29.72 | 10700 | 2.5237 | 0.7360 | 0.6880 |
|
179 |
+
| 0.001 | 30.0 | 10800 | 2.5243 | 0.7361 | 0.6881 |
|
180 |
|
181 |
|
182 |
### Framework versions
|
eval_results_ml.json
CHANGED
@@ -1 +1 @@
|
|
1 |
-
{"
|
|
|
1 |
+
{"zh-CN": {"f1": 0.7634646910156971, "accuracy": 0.808338937457969}, "af-ZA": {"f1": 0.7132538836455592, "accuracy": 0.7726967047747142}, "sl-SL": {"f1": 0.6815919599392998, "accuracy": 0.7283120376597175}, "jv-ID": {"f1": 0.5042661262944395, "accuracy": 0.5622057834566241}, "ms-MY": {"f1": 0.7076965391417502, "accuracy": 0.7814391392064559}, "en-US": {"f1": 0.8492541138839934, "accuracy": 0.8850033624747814}, "pl-PL": {"f1": 0.7658905188800398, "accuracy": 0.8332212508406187}, "pt-PT": {"f1": 0.7748214581945277, "accuracy": 0.8187626092804304}, "sq-AL": {"f1": 0.6512385364922972, "accuracy": 0.7293207800941492}, "ar-SA": {"f1": 0.5950037905546635, "accuracy": 0.66408876933423}, "nl-NL": {"f1": 0.7778812586092927, "accuracy": 0.8389374579690653}, "nb-NO": {"f1": 0.7791878747868659, "accuracy": 0.8231338264963013}, "hi-IN": {"f1": 0.6619529753957635, "accuracy": 0.7380632145258911}, "am-ET": {"f1": 0.39801750136825015, "accuracy": 0.4751176866173504}, "hy-AM": {"f1": 0.6524514425641766, "accuracy": 0.7094821788836584}, "es-ES": {"f1": 0.7855956507266095, "accuracy": 0.8197713517148622}, "mn-MN": {"f1": 0.5616803093933431, "accuracy": 0.6469401479488904}, "my-MM": {"f1": 0.6284189841636294, "accuracy": 0.6987222595830531}, "id-ID": {"f1": 0.7667589389394739, "accuracy": 0.8248150638870209}, "bn-BD": {"f1": 0.5976275362376169, "accuracy": 0.6849361129791527}, "ml-IN": {"f1": 0.6599482579585687, "accuracy": 0.7256220578345662}, "kn-IN": {"f1": 0.6064156821788045, "accuracy": 0.6684599865501009}, "th-TH": {"f1": 0.7256521121455951, "accuracy": 0.7642905178211163}, "te-IN": {"f1": 0.5833162050913214, "accuracy": 0.668123739071957}, "da-DK": {"f1": 0.7583186176075081, "accuracy": 0.8227975790181573}, "ko-KR": {"f1": 0.6690864516544772, "accuracy": 0.7175521183591123}, "de-DE": {"f1": 0.7950178636361679, "accuracy": 0.8453261600537996}, "vi-VN": {"f1": 0.6943169086726416, "accuracy": 0.7525218560860794}, "ca-ES": {"f1": 0.7044845715136698, "accuracy": 0.7542030934767989}, "lv-LV": {"f1": 0.6759393003776176, "accuracy": 0.715198386012105}, "km-KH": {"f1": 0.5999230459937922, "accuracy": 0.660053799596503}, "ur-PK": {"f1": 0.5808396212998461, "accuracy": 0.6455951580363147}, "ro-RO": {"f1": 0.7441666946584294, "accuracy": 0.7911903160726295}, "fa-IR": {"f1": 0.7366192685998338, "accuracy": 0.7911903160726295}, "fi-FI": {"f1": 0.6733547663279966, "accuracy": 0.7427706792199058}, "tr-TR": {"f1": 0.729186622598369, "accuracy": 0.7955615332885003}, "az-AZ": {"f1": 0.6569521420870025, "accuracy": 0.7158708809683927}, "ja-JP": {"f1": 0.7684295783831104, "accuracy": 0.8113651647612643}, "sv-SE": {"f1": 0.782019513774951, "accuracy": 0.8365837256220578}, "cy-GB": {"f1": 0.31239075783660997, "accuracy": 0.417955615332885}, "ta-IN": {"f1": 0.6494363690911867, "accuracy": 0.7098184263618023}, "he-IL": {"f1": 0.6839618968239004, "accuracy": 0.7552118359112306}, "it-IT": {"f1": 0.751320651617522, "accuracy": 0.8100201748486886}, "ka-GE": {"f1": 0.5926929810384952, "accuracy": 0.640551445864156}, "ru-RU": {"f1": 0.7615343987482663, "accuracy": 0.8032952252858103}, "el-GR": {"f1": 0.7183647189599724, "accuracy": 0.7760591795561533}, "hu-HU": {"f1": 0.707958439982561, "accuracy": 0.7794216543375925}, "fr-FR": {"f1": 0.779806731815328, "accuracy": 0.8275050437121722}, "is-IS": {"f1": 0.5763500866282868, "accuracy": 0.6691324815063887}, "tl-PH": {"f1": 0.5531019350487393, "accuracy": 0.648285137861466}, "sw-KE": {"f1": 0.5448600851818804, "accuracy": 0.6193678547410895}, "zh-TW": {"f1": 0.7582629150324649, "accuracy": 0.7790854068594486}, "all": {"f1": 0.6851210924217023, "accuracy": 0.7366018312554964}}
|
pytorch_model.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 1115491954
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f5668eb9820da86cdde746b80dd4b6efcd3be5d8d44eff56f0bf84fe6688faa3
|
3 |
size 1115491954
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b3e32e48b2407cc75242b20a8317df8576dc5126028283d1c681247efe33ce91
|
3 |
+
size 4536
|