lewtun HF staff commited on
Commit
030bb52
1 Parent(s): 17d50a1

Add evaluation results on the mnli config of glue

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the mnli config of the [glue](https://huggingface.co/datasets/glue) dataset by

@lewtun

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-glue-mnli-026a6e-14686015).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=glue).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=glue).

Files changed (1) hide show
  1. README.md +53 -0
README.md CHANGED
@@ -22,6 +22,59 @@ model-index:
22
  - name: Accuracy
23
  type: accuracy
24
  value: 0.8230268510984541
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
25
  ---
26
 
27
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
22
  - name: Accuracy
23
  type: accuracy
24
  value: 0.8230268510984541
25
+ - task:
26
+ type: natural-language-inference
27
+ name: Natural Language Inference
28
+ dataset:
29
+ name: glue
30
+ type: glue
31
+ config: mnli
32
+ split: validation_matched
33
+ metrics:
34
+ - name: Accuracy
35
+ type: accuracy
36
+ value: 0.8189505858380031
37
+ verified: true
38
+ - name: Precision Macro
39
+ type: precision
40
+ value: 0.8179669104455792
41
+ verified: true
42
+ - name: Precision Micro
43
+ type: precision
44
+ value: 0.8189505858380031
45
+ verified: true
46
+ - name: Precision Weighted
47
+ type: precision
48
+ value: 0.8185679295201952
49
+ verified: true
50
+ - name: Recall Macro
51
+ type: recall
52
+ value: 0.8175820569584179
53
+ verified: true
54
+ - name: Recall Micro
55
+ type: recall
56
+ value: 0.8189505858380031
57
+ verified: true
58
+ - name: Recall Weighted
59
+ type: recall
60
+ value: 0.8189505858380031
61
+ verified: true
62
+ - name: F1 Macro
63
+ type: f1
64
+ value: 0.8176177699916428
65
+ verified: true
66
+ - name: F1 Micro
67
+ type: f1
68
+ value: 0.8189505858380031
69
+ verified: true
70
+ - name: F1 Weighted
71
+ type: f1
72
+ value: 0.8186059524762352
73
+ verified: true
74
+ - name: loss
75
+ type: loss
76
+ value: 0.46445730328559875
77
+ verified: true
78
  ---
79
 
80
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You