research-backup
/

roberta-large-semeval2012-mask-prompt-d-nce-classification

@@ -14,7 +14,7 @@ model-index:
     metrics:
     - name: Accuracy
       type: accuracy
-      value: None
   - task:
       name: Analogy Questions (SAT full)
       type: multiple-choice-qa
@@ -25,7 +25,7 @@ model-index:
     metrics:
     - name: Accuracy
       type: accuracy
-      value: None
   - task:
       name: Analogy Questions (SAT)
       type: multiple-choice-qa
@@ -36,7 +36,7 @@ model-index:
     metrics:
     - name: Accuracy
       type: accuracy
-      value: None
   - task:
       name: Analogy Questions (BATS)
       type: multiple-choice-qa
@@ -47,7 +47,7 @@ model-index:
     metrics:
     - name: Accuracy
       type: accuracy
-      value: None
   - task:
       name: Analogy Questions (Google)
       type: multiple-choice-qa
@@ -58,7 +58,7 @@ model-index:
     metrics:
     - name: Accuracy
       type: accuracy
-      value: None
   - task:
       name: Analogy Questions (U2)
       type: multiple-choice-qa
@@ -69,7 +69,7 @@ model-index:
     metrics:
     - name: Accuracy
       type: accuracy
-      value: None
   - task:
       name: Analogy Questions (U4)
       type: multiple-choice-qa
@@ -80,7 +80,7 @@ model-index:
     metrics:
     - name: Accuracy
       type: accuracy
-      value: None
   - task:
       name: Lexical Relation Classification (BLESS)
       type: classification
@@ -91,10 +91,10 @@ model-index:
     metrics:
     - name: F1
       type: f1
-      value: None
     - name: F1 (macro)
       type: f1_macro
-      value: None
   - task:
       name: Lexical Relation Classification (CogALexV)
       type: classification
@@ -105,10 +105,10 @@ model-index:
     metrics:
     - name: F1
       type: f1
-      value: None
     - name: F1 (macro)
       type: f1_macro
-      value: None
   - task:
       name: Lexical Relation Classification (EVALution)
       type: classification
@@ -119,10 +119,10 @@ model-index:
     metrics:
     - name: F1
       type: f1
-      value: None
     - name: F1 (macro)
       type: f1_macro
-      value: None
   - task:
       name: Lexical Relation Classification (K&H+N)
       type: classification
@@ -133,10 +133,10 @@ model-index:
     metrics:
     - name: F1
       type: f1
-      value: None
     - name: F1 (macro)
       type: f1_macro
-      value: None
   - task:
       name: Lexical Relation Classification (ROOT09)
       type: classification
@@ -147,10 +147,10 @@ model-index:
     metrics:
     - name: F1
       type: f1
-      value: None
     - name: F1 (macro)
       type: f1_macro
-      value: None
 ---
 # relbert/roberta-large-semeval2012-mask-prompt-d-nce-classification
@@ -160,20 +160,20 @@ RelBERT fine-tuned from [roberta-large](https://huggingface.co/roberta-large) on
 Fine-tuning is done via [RelBERT](https://github.com/asahi417/relbert) library (see the repository for more detail).
 It achieves the following results on the relation understanding tasks:
 - Analogy Question ([dataset](https://huggingface.co/datasets/relbert/analogy_questions), [full result](https://huggingface.co/relbert/roberta-large-semeval2012-mask-prompt-d-nce-classification/raw/main/analogy.json)):
-    - Accuracy on SAT (full): None
-    - Accuracy on SAT: None
-    - Accuracy on BATS: None
-    - Accuracy on U2: None
-    - Accuracy on U4: None
-    - Accuracy on Google: None
 - Lexical Relation Classification ([dataset](https://huggingface.co/datasets/relbert/lexical_relation_classification), [full result](https://huggingface.co/relbert/roberta-large-semeval2012-mask-prompt-d-nce-classification/raw/main/classification.json)):
-    - Micro F1 score on BLESS: None
-    - Micro F1 score on CogALexV: None
-    - Micro F1 score on EVALution: None
-    - Micro F1 score on K&H+N: None
-    - Micro F1 score on ROOT09: None
 - Relation Mapping ([dataset](https://huggingface.co/datasets/relbert/relation_mapping), [full result](https://huggingface.co/relbert/roberta-large-semeval2012-mask-prompt-d-nce-classification/raw/main/relation_mapping.json)):
-    - Accuracy on Relation Mapping: None
 ### Usage

     metrics:
     - name: Accuracy
       type: accuracy
+      value: 0.796765873015873
   - task:
       name: Analogy Questions (SAT full)
       type: multiple-choice-qa
     metrics:
     - name: Accuracy
       type: accuracy
+      value: 0.6524064171122995
   - task:
       name: Analogy Questions (SAT)
       type: multiple-choice-qa
     metrics:
     - name: Accuracy
       type: accuracy
+      value: 0.6498516320474778
   - task:
       name: Analogy Questions (BATS)
       type: multiple-choice-qa
     metrics:
     - name: Accuracy
       type: accuracy
+      value: 0.7509727626459144
   - task:
       name: Analogy Questions (Google)
       type: multiple-choice-qa
     metrics:
     - name: Accuracy
       type: accuracy
+      value: 0.902
   - task:
       name: Analogy Questions (U2)
       type: multiple-choice-qa
     metrics:
     - name: Accuracy
       type: accuracy
+      value: 0.6271929824561403
   - task:
       name: Analogy Questions (U4)
       type: multiple-choice-qa
     metrics:
     - name: Accuracy
       type: accuracy
+      value: 0.625
   - task:
       name: Lexical Relation Classification (BLESS)
       type: classification
     metrics:
     - name: F1
       type: f1
+      value: 0.9246647581738737
     - name: F1 (macro)
       type: f1_macro
+      value: 0.9201116139693363
   - task:
       name: Lexical Relation Classification (CogALexV)
       type: classification
     metrics:
     - name: F1
       type: f1
+      value: 0.8826291079812206
     - name: F1 (macro)
       type: f1_macro
+      value: 0.74506786895136
   - task:
       name: Lexical Relation Classification (EVALution)
       type: classification
     metrics:
     - name: F1
       type: f1
+      value: 0.7172264355362946
     - name: F1 (macro)
       type: f1_macro
+      value: 0.703292242462215
   - task:
       name: Lexical Relation Classification (K&H+N)
       type: classification
     metrics:
     - name: F1
       type: f1
+      value: 0.9616748974055783
     - name: F1 (macro)
       type: f1_macro
+      value: 0.8934154139843127
   - task:
       name: Lexical Relation Classification (ROOT09)
       type: classification
     metrics:
     - name: F1
       type: f1
+      value: 0.9094327796928863
     - name: F1 (macro)
       type: f1_macro
+      value: 0.906471425124189
 ---
 # relbert/roberta-large-semeval2012-mask-prompt-d-nce-classification
 Fine-tuning is done via [RelBERT](https://github.com/asahi417/relbert) library (see the repository for more detail).
 It achieves the following results on the relation understanding tasks:
 - Analogy Question ([dataset](https://huggingface.co/datasets/relbert/analogy_questions), [full result](https://huggingface.co/relbert/roberta-large-semeval2012-mask-prompt-d-nce-classification/raw/main/analogy.json)):
+    - Accuracy on SAT (full): 0.6524064171122995
+    - Accuracy on SAT: 0.6498516320474778
+    - Accuracy on BATS: 0.7509727626459144
+    - Accuracy on U2: 0.6271929824561403
+    - Accuracy on U4: 0.625
+    - Accuracy on Google: 0.902
 - Lexical Relation Classification ([dataset](https://huggingface.co/datasets/relbert/lexical_relation_classification), [full result](https://huggingface.co/relbert/roberta-large-semeval2012-mask-prompt-d-nce-classification/raw/main/classification.json)):
+    - Micro F1 score on BLESS: 0.9246647581738737
+    - Micro F1 score on CogALexV: 0.8826291079812206
+    - Micro F1 score on EVALution: 0.7172264355362946
+    - Micro F1 score on K&H+N: 0.9616748974055783
+    - Micro F1 score on ROOT09: 0.9094327796928863
 - Relation Mapping ([dataset](https://huggingface.co/datasets/relbert/relation_mapping), [full result](https://huggingface.co/relbert/roberta-large-semeval2012-mask-prompt-d-nce-classification/raw/main/relation_mapping.json)):
+    - Accuracy on Relation Mapping: 0.796765873015873
 ### Usage