leaderboard-pt-pr-bot commited on
Commit
892079d
1 Parent(s): 73ed09c

Fixing some errors of the leaderboard evaluation results in the ModelCard yaml

Browse files

The name of a few benchmarks are incorrect on the model metadata. This commit fixes some minor errors of the [last PR](1) on the ModelCard YAML metadata.

Files changed (1) hide show
  1. README.md +27 -1
README.md CHANGED
@@ -71,6 +71,19 @@ model-index:
71
  - type: f1_macro
72
  value: 43.01
73
  name: f1-macro
 
 
 
 
 
 
 
 
 
 
 
 
 
74
  - type: pearson
75
  value: 8.92
76
  name: pearson
@@ -98,7 +111,7 @@ model-index:
98
  name: Text Generation
99
  dataset:
100
  name: HateBR Binary
101
- type: eduagarcia/portuguese_benchmark
102
  split: test
103
  args:
104
  num_few_shot: 25
@@ -106,6 +119,19 @@ model-index:
106
  - type: f1_macro
107
  value: 50.46
108
  name: f1-macro
 
 
 
 
 
 
 
 
 
 
 
 
 
109
  - type: f1_macro
110
  value: 41.19
111
  name: f1-macro
 
71
  - type: f1_macro
72
  value: 43.01
73
  name: f1-macro
74
+ source:
75
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=wandgibaut/periquito-3B
76
+ name: Open Portuguese LLM Leaderboard
77
+ - task:
78
+ type: text-generation
79
+ name: Text Generation
80
+ dataset:
81
+ name: Assin2 STS
82
+ type: eduagarcia/portuguese_benchmark
83
+ split: test
84
+ args:
85
+ num_few_shot: 15
86
+ metrics:
87
  - type: pearson
88
  value: 8.92
89
  name: pearson
 
111
  name: Text Generation
112
  dataset:
113
  name: HateBR Binary
114
+ type: ruanchaves/hatebr
115
  split: test
116
  args:
117
  num_few_shot: 25
 
119
  - type: f1_macro
120
  value: 50.46
121
  name: f1-macro
122
+ source:
123
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=wandgibaut/periquito-3B
124
+ name: Open Portuguese LLM Leaderboard
125
+ - task:
126
+ type: text-generation
127
+ name: Text Generation
128
+ dataset:
129
+ name: PT Hate Speech Binary
130
+ type: hate_speech_portuguese
131
+ split: test
132
+ args:
133
+ num_few_shot: 25
134
+ metrics:
135
  - type: f1_macro
136
  value: 41.19
137
  name: f1-macro