Update README.md
Browse files
README.md
CHANGED
@@ -36,14 +36,14 @@ ArabicT5-xLarge --> ArabicT5-17GB-large
|
|
36 |
|
37 |
| Model | <center>TyDi QA| <center>HARD| <center>ArSarcasm-v2-Sentiment| <center>ArSarcasm-v2-Sarcasm| XL-SUM |
|
38 |
|----------------------|---------------|---------------------|-------------------------------------|----------------------------------|----------------------------------
|
39 |
-
| AraT5-base | <center>70.
|
40 |
-
| AraT5-msa-base | <center>70.
|
41 |
-
| AraT5-tweets-base | <center>65.
|
42 |
-
| mT5-base | <center>72.
|
43 |
-
| AraBART-base | <center>48.
|
44 |
-
| ArabicT5-17GB-small | <center>70.
|
45 |
-
| ArabicT5-17GB-base | <center>73.
|
46 |
-
| ArabicT5-17GB-large | <center>**75.
|
47 |
|
48 |
Evaluation Metrics: TyDi QA (EM/F1), HARD (Accuracy), Sentiment Analysis (Accuracy / F1-PN positive-negative), Sarcasm Detection (F1-sarcastic), XL-SUM (Rouge-L with Stemmer).
|
49 |
|
|
|
36 |
|
37 |
| Model | <center>TyDi QA| <center>HARD| <center>ArSarcasm-v2-Sentiment| <center>ArSarcasm-v2-Sarcasm| XL-SUM |
|
38 |
|----------------------|---------------|---------------------|-------------------------------------|----------------------------------|----------------------------------
|
39 |
+
| AraT5-base | <center>70.4/84.2 |<center>**96.5**|<center>69.7/72.6|<center>60.4|<center>30.3|
|
40 |
+
| AraT5-msa-base | <center>70.9/84.0 |<center>**96.5**|<center>70.0/72.7|<center>60.7|<center>27.4|
|
41 |
+
| AraT5-tweets-base | <center>65.1/79.0 |<center>96.3|<center>70.7/73.5|<center>61.1|<center>25.1|
|
42 |
+
| mT5-base | <center>72.2/84.1 |<center>96.2|<center>67.3/68.8|<center>52.2|<center>25.7|
|
43 |
+
| AraBART-base | <center>48.8/71.2 |<center>96.1|<center>66.2/68.2|<center>56.3|<center>31.2|
|
44 |
+
| ArabicT5-17GB-small | <center>70.8/84.8 |<center>96.4|<center>68.9/71.2|<center>58.9|<center>29.2|
|
45 |
+
| ArabicT5-17GB-base | <center>73.3/86.1 |<center>96.4|<center>70.4/73.0|<center>59.8|<center>30.3|
|
46 |
+
| ArabicT5-17GB-large | <center>**75.5/87.1** |<center>**96.5**| <center>**72.2/75.2**|<center>**61.7**|<center>**31.7**|
|
47 |
|
48 |
Evaluation Metrics: TyDi QA (EM/F1), HARD (Accuracy), Sentiment Analysis (Accuracy / F1-PN positive-negative), Sarcasm Detection (F1-sarcastic), XL-SUM (Rouge-L with Stemmer).
|
49 |
|