Initial Commit

Browse files

Files changed (5) hide show

README.md +51 -51
config.json +1 -1
eval_results_cardiff.json +1 -0
pytorch_model.bin +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -1,6 +1,6 @@
 ---
 license: mit
-base_model: microsoft/mdeberta-v3-base
 tags:
 - generated_from_trainer
 datasets:
@@ -18,11 +18,11 @@ should probably proofread and complete it, then remove this comment. -->
 # scenario-KD-PR-CDF-ALL-D2_data-cardiffnlp_tweet_sentiment_multilingual_all55
-This model is a fine-tuned version of [microsoft/mdeberta-v3-base](https://huggingface.co/microsoft/mdeberta-v3-base) on the tweet_sentiment_multilingual dataset.
 It achieves the following results on the evaluation set:
-- Loss: 7.1392
-- Accuracy: 0.5795
-- F1: 0.5799
 ## Model description
@@ -53,52 +53,52 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step  | Validation Loss | Accuracy | F1     |
 |:-------------:|:-----:|:-----:|:---------------:|:--------:|:------:|
-| 7.669         | 1.09  | 500   | 5.2297          | 0.5401   | 0.5412 |
-| 5.7891        | 2.17  | 1000  | 5.6794          | 0.5374   | 0.5287 |
-| 4.4155        | 3.26  | 1500  | 5.6114          | 0.5795   | 0.5803 |
-| 3.1781        | 4.35  | 2000  | 5.9508          | 0.5644   | 0.5647 |
-| 2.4652        | 5.43  | 2500  | 6.8498          | 0.5575   | 0.5541 |
-| 1.9293        | 6.52  | 3000  | 6.8127          | 0.5629   | 0.5586 |
-| 1.5873        | 7.61  | 3500  | 6.4357          | 0.5733   | 0.5730 |
-| 1.3713        | 8.7   | 4000  | 6.5502          | 0.5490   | 0.5512 |
-| 1.1665        | 9.78  | 4500  | 7.1228          | 0.5586   | 0.5603 |
-| 1.0433        | 10.87 | 5000  | 6.9788          | 0.5583   | 0.5599 |
-| 0.9457        | 11.96 | 5500  | 7.5000          | 0.5629   | 0.5611 |
-| 0.8346        | 13.04 | 6000  | 6.8302          | 0.5606   | 0.5624 |
-| 0.7455        | 14.13 | 6500  | 7.1573          | 0.5617   | 0.5624 |
-| 0.7037        | 15.22 | 7000  | 6.8806          | 0.5579   | 0.5583 |
-| 0.6155        | 16.3  | 7500  | 6.9611          | 0.5613   | 0.5596 |
-| 0.5969        | 17.39 | 8000  | 7.2010          | 0.5594   | 0.5598 |
-| 0.5308        | 18.48 | 8500  | 7.0794          | 0.5644   | 0.5631 |
-| 0.5012        | 19.57 | 9000  | 7.1497          | 0.5633   | 0.5641 |
-| 0.4625        | 20.65 | 9500  | 6.8539          | 0.5621   | 0.5636 |
-| 0.4561        | 21.74 | 10000 | 7.0273          | 0.5687   | 0.5680 |
-| 0.411         | 22.83 | 10500 | 7.3185          | 0.5687   | 0.5684 |
-| 0.3829        | 23.91 | 11000 | 7.1758          | 0.5714   | 0.5721 |
-| 0.3748        | 25.0  | 11500 | 7.5333          | 0.5656   | 0.5658 |
-| 0.3615        | 26.09 | 12000 | 7.2431          | 0.5602   | 0.5613 |
-| 0.3296        | 27.17 | 12500 | 7.4016          | 0.5617   | 0.5639 |
-| 0.3243        | 28.26 | 13000 | 7.1870          | 0.5745   | 0.5744 |
-| 0.31          | 29.35 | 13500 | 7.0699          | 0.5741   | 0.5747 |
-| 0.3029        | 30.43 | 14000 | 7.2708          | 0.5675   | 0.5676 |
-| 0.2944        | 31.52 | 14500 | 6.9788          | 0.5768   | 0.5766 |
-| 0.2842        | 32.61 | 15000 | 7.1500          | 0.5806   | 0.5818 |
-| 0.2798        | 33.7  | 15500 | 7.4410          | 0.5837   | 0.5845 |
-| 0.2661        | 34.78 | 16000 | 7.5007          | 0.5741   | 0.5753 |
-| 0.2651        | 35.87 | 16500 | 7.4349          | 0.5714   | 0.5724 |
-| 0.2621        | 36.96 | 17000 | 7.1030          | 0.5741   | 0.5747 |
-| 0.2502        | 38.04 | 17500 | 6.9999          | 0.5760   | 0.5760 |
-| 0.2456        | 39.13 | 18000 | 7.1837          | 0.5752   | 0.5757 |
-| 0.2464        | 40.22 | 18500 | 7.2256          | 0.5698   | 0.5709 |
-| 0.2361        | 41.3  | 19000 | 7.2977          | 0.5664   | 0.5671 |
-| 0.2349        | 42.39 | 19500 | 7.2691          | 0.5706   | 0.5713 |
-| 0.2332        | 43.48 | 20000 | 7.1631          | 0.5718   | 0.5726 |
-| 0.2319        | 44.57 | 20500 | 7.2903          | 0.5768   | 0.5769 |
-| 0.2312        | 45.65 | 21000 | 7.1751          | 0.5783   | 0.5788 |
-| 0.2234        | 46.74 | 21500 | 7.1176          | 0.5764   | 0.5770 |
-| 0.2281        | 47.83 | 22000 | 7.3225          | 0.5660   | 0.5662 |
-| 0.2224        | 48.91 | 22500 | 7.1554          | 0.5841   | 0.5849 |
-| 0.2282        | 50.0  | 23000 | 7.1392          | 0.5795   | 0.5799 |
 ### Framework versions

 ---
 license: mit
+base_model: haryoaw/scenario-MDBT-TCR_data-cardiffnlp_tweet_sentiment_multilingual_all
 tags:
 - generated_from_trainer
 datasets:
 # scenario-KD-PR-CDF-ALL-D2_data-cardiffnlp_tweet_sentiment_multilingual_all55
+This model is a fine-tuned version of [haryoaw/scenario-MDBT-TCR_data-cardiffnlp_tweet_sentiment_multilingual_all](https://huggingface.co/haryoaw/scenario-MDBT-TCR_data-cardiffnlp_tweet_sentiment_multilingual_all) on the tweet_sentiment_multilingual dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.2180
+- Accuracy: 0.5999
+- F1: 0.6008
 ## Model description
 | Training Loss | Epoch | Step  | Validation Loss | Accuracy | F1     |
 |:-------------:|:-----:|:-----:|:---------------:|:--------:|:------:|
+| 1.2423        | 1.09  | 500   | 1.2214          | 0.4842   | 0.4591 |
+| 1.1465        | 2.17  | 1000  | 1.2081          | 0.5498   | 0.5406 |
+| 1.089         | 3.26  | 1500  | 1.2345          | 0.5540   | 0.5476 |
+| 1.043         | 4.35  | 2000  | 1.2340          | 0.5756   | 0.5777 |
+| 1.01          | 5.43  | 2500  | 1.2397          | 0.5706   | 0.5717 |
+| 0.9787        | 6.52  | 3000  | 1.2536          | 0.5718   | 0.5723 |
+| 0.9656        | 7.61  | 3500  | 1.2564          | 0.5579   | 0.5603 |
+| 0.9505        | 8.7   | 4000  | 1.2641          | 0.5644   | 0.5660 |
+| 0.9432        | 9.78  | 4500  | 1.2385          | 0.5880   | 0.5876 |
+| 0.9304        | 10.87 | 5000  | 1.2612          | 0.5864   | 0.5862 |
+| 0.9245        | 11.96 | 5500  | 1.2567          | 0.5748   | 0.5728 |
+| 0.9189        | 13.04 | 6000  | 1.2463          | 0.5745   | 0.5745 |
+| 0.9131        | 14.13 | 6500  | 1.2599          | 0.5729   | 0.5738 |
+| 0.9098        | 15.22 | 7000  | 1.2614          | 0.5706   | 0.5704 |
+| 0.9052        | 16.3  | 7500  | 1.2468          | 0.5741   | 0.5748 |
+| 0.9013        | 17.39 | 8000  | 1.2550          | 0.5756   | 0.5775 |
+| 0.8972        | 18.48 | 8500  | 1.2661          | 0.5733   | 0.5743 |
+| 0.8972        | 19.57 | 9000  | 1.2506          | 0.5783   | 0.5780 |
+| 0.8912        | 20.65 | 9500  | 1.2519          | 0.5737   | 0.5752 |
+| 0.8903        | 21.74 | 10000 | 1.2313          | 0.5795   | 0.5782 |
+| 0.8868        | 22.83 | 10500 | 1.2384          | 0.5895   | 0.5896 |
+| 0.8847        | 23.91 | 11000 | 1.2474          | 0.5752   | 0.5736 |
+| 0.8834        | 25.0  | 11500 | 1.2458          | 0.5791   | 0.5795 |
+| 0.8815        | 26.09 | 12000 | 1.2548          | 0.5748   | 0.5739 |
+| 0.8794        | 27.17 | 12500 | 1.2378          | 0.5864   | 0.5857 |
+| 0.8791        | 28.26 | 13000 | 1.2327          | 0.5968   | 0.5953 |
+| 0.8749        | 29.35 | 13500 | 1.2249          | 0.5949   | 0.5935 |
+| 0.8748        | 30.43 | 14000 | 1.2309          | 0.5938   | 0.5905 |
+| 0.8734        | 31.52 | 14500 | 1.2242          | 0.5880   | 0.5885 |
+| 0.872         | 32.61 | 15000 | 1.2372          | 0.5841   | 0.5856 |
+| 0.8712        | 33.7  | 15500 | 1.2394          | 0.5783   | 0.5800 |
+| 0.87          | 34.78 | 16000 | 1.2363          | 0.5922   | 0.5921 |
+| 0.8692        | 35.87 | 16500 | 1.2375          | 0.5903   | 0.5916 |
+| 0.8677        | 36.96 | 17000 | 1.2341          | 0.5968   | 0.5951 |
+| 0.8672        | 38.04 | 17500 | 1.2227          | 0.6038   | 0.6013 |
+| 0.8657        | 39.13 | 18000 | 1.2250          | 0.5899   | 0.5904 |
+| 0.865         | 40.22 | 18500 | 1.2275          | 0.5949   | 0.5952 |
+| 0.865         | 41.3  | 19000 | 1.2196          | 0.5953   | 0.5958 |
+| 0.864         | 42.39 | 19500 | 1.2375          | 0.5818   | 0.5815 |
+| 0.8636        | 43.48 | 20000 | 1.2373          | 0.5849   | 0.5856 |
+| 0.8635        | 44.57 | 20500 | 1.2292          | 0.5930   | 0.5940 |
+| 0.8622        | 45.65 | 21000 | 1.2243          | 0.5903   | 0.5914 |
+| 0.8619        | 46.74 | 21500 | 1.2198          | 0.5984   | 0.5992 |
+| 0.8608        | 47.83 | 22000 | 1.2175          | 0.6046   | 0.6054 |
+| 0.8621        | 48.91 | 22500 | 1.2179          | 0.5995   | 0.6004 |
+| 0.8606        | 50.0  | 23000 | 1.2180          | 0.5999   | 0.6008 |
 ### Framework versions

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "microsoft/mdeberta-v3-base",
   "architectures": [
     "DebertaForSequenceClassificationKD"
   ],

 {
+  "_name_or_path": "haryoaw/scenario-MDBT-TCR_data-cardiffnlp_tweet_sentiment_multilingual_all",
   "architectures": [
     "DebertaForSequenceClassificationKD"
   ],

eval_results_cardiff.json ADDED Viewed

	@@ -0,0 +1 @@

+ {"arabic": {"f1": 0.5612630228065163, "accuracy": 0.5586206896551724, "confusion_matrix": [[155, 101, 34], [57, 177, 56], [33, 103, 154]]}, "english": {"f1": 0.6449384250034303, "accuracy": 0.6459770114942529, "confusion_matrix": [[225, 50, 15], [83, 171, 36], [41, 83, 166]]}, "french": {"f1": 0.6468819435660574, "accuracy": 0.6471264367816092, "confusion_matrix": [[198, 47, 45], [32, 201, 57], [43, 83, 164]]}, "german": {"f1": 0.6930782783320829, "accuracy": 0.6931034482758621, "confusion_matrix": [[201, 51, 38], [48, 214, 28], [39, 63, 188]]}, "hindi": {"f1": 0.45263678447417727, "accuracy": 0.4528735632183908, "confusion_matrix": [[140, 85, 65], [78, 129, 83], [88, 77, 125]]}, "italian": {"f1": 0.5675275816402665, "accuracy": 0.5712643678160919, "confusion_matrix": [[147, 84, 59], [20, 212, 58], [50, 102, 138]]}, "portuguese": {"f1": 0.6265368086078835, "accuracy": 0.6241379310344828, "confusion_matrix": [[187, 82, 21], [66, 173, 51], [34, 73, 183]]}, "spanish": {"f1": 0.5742844725965642, "accuracy": 0.5735632183908046, "confusion_matrix": [[179, 73, 38], [71, 153, 66], [43, 80, 167]]}, "all": {"f1": 0.5959713022267769, "accuracy": 0.5952586206896552, "confusion_matrix": [[1430, 577, 313], [481, 1409, 430], [368, 648, 1304]]}}

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:fff4bdbd1414848f100a88a9dfeff9199e592bccb400f5cbec8c9164e50bbed5
 size 946740394

 version https://git-lfs.github.com/spec/v1
+oid sha256:9410f1db316c62c25aade5cce7321dfa98ca92f1cae2a4049b5d2ca901f1eb1f
 size 946740394

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:141e95e5ec0bb44612c6efb8f7543206830658c0e94d7805c72b6e846a4b7a3f
 size 4664

 version https://git-lfs.github.com/spec/v1
+oid sha256:631f5fe9fb15addad2d2a7dc0554e507cc3f01a591847e41314c5f5747b45133
 size 4664