mascIT
/

bertina-3M

@@ -41,6 +41,32 @@ Use it as a foundational model to be finetuned for specific italian tasks.
 - optim: AdamW (beta_1=0.8)
 - weight_decay: 1e-2
-# Eval
-- perplexity: 19 (it's a 12MB model!)

 - optim: AdamW (beta_1=0.8)
 - weight_decay: 1e-2
+- Dev set perplexity: 19 (it's a 12MB model!)
+# Evaluation (UINAUIL)
+Following the [UINAUIL setup](https://github.com/valeriobasile/uinauil/tree/main) we can summarise the following results on BERTINA-3M:
+**CLASSIFICATION TASKS**
+```
+task,type,p,r,f1,acc
+haspeede,classification,0.699,0.687,0.680,0.685
+ironita,classification,0.701,0.701,0.701,0.701
+sentipolc,classification,0.649,0.588,0.587,0.560
+```
+**ENTAILMENT TASKS**
+```
+task,type,p,r,f1,acc
+textualentailment,entailment,0.423,0.530,0.401,0.530
+```
+**SEQUENCE TASKS**
+```
+task,type,acc
+eventi,NER,0.835
+facta,NER,0.967
+```