Commit
•
d5fb657
1
Parent(s):
e49ab07
Add verifyToken field to verify evaluation results are produced by Hugging Face's automatic model evaluator
Browse filesBeep boop, I am a bot from Hugging Face's automatic model evaluator 👋! We've added a new `verifyToken` field to your evaluation results to verify that they are produced by the model evaluator. Accept this PR to ensure that your results remain listed as **verified** on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards).
README.md
CHANGED
@@ -2,6 +2,13 @@
|
|
2 |
license: mit
|
3 |
tags:
|
4 |
- generated_from_trainer
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
5 |
model-index:
|
6 |
- name: rob-base-superqa
|
7 |
results:
|
@@ -14,14 +21,16 @@ model-index:
|
|
14 |
config: adversarialQA
|
15 |
split: validation
|
16 |
metrics:
|
17 |
-
-
|
18 |
-
type: exact_match
|
19 |
value: 43.8667
|
|
|
20 |
verified: true
|
21 |
-
|
22 |
-
|
23 |
value: 55.135
|
|
|
24 |
verified: true
|
|
|
25 |
- task:
|
26 |
type: question-answering
|
27 |
name: Question Answering
|
@@ -31,14 +40,16 @@ model-index:
|
|
31 |
config: squad_v2
|
32 |
split: validation
|
33 |
metrics:
|
34 |
-
-
|
35 |
-
type: exact_match
|
36 |
value: 79.2432
|
|
|
37 |
verified: true
|
38 |
-
|
39 |
-
|
40 |
value: 82.336
|
|
|
41 |
verified: true
|
|
|
42 |
- task:
|
43 |
type: question-answering
|
44 |
name: Question Answering
|
@@ -48,21 +59,16 @@ model-index:
|
|
48 |
config: default
|
49 |
split: validation
|
50 |
metrics:
|
51 |
-
-
|
52 |
-
type: exact_match
|
53 |
value: 78.8581
|
|
|
54 |
verified: true
|
55 |
-
|
56 |
-
|
57 |
value: 82.8261
|
|
|
58 |
verified: true
|
59 |
-
|
60 |
-
- question-answering
|
61 |
-
datasets:
|
62 |
-
- squad_v2
|
63 |
-
- quoref
|
64 |
-
- adversarial_qa
|
65 |
-
- duorc
|
66 |
---
|
67 |
|
68 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
|
|
2 |
license: mit
|
3 |
tags:
|
4 |
- generated_from_trainer
|
5 |
+
datasets:
|
6 |
+
- squad_v2
|
7 |
+
- quoref
|
8 |
+
- adversarial_qa
|
9 |
+
- duorc
|
10 |
+
task:
|
11 |
+
- question-answering
|
12 |
model-index:
|
13 |
- name: rob-base-superqa
|
14 |
results:
|
|
|
21 |
config: adversarialQA
|
22 |
split: validation
|
23 |
metrics:
|
24 |
+
- type: exact_match
|
|
|
25 |
value: 43.8667
|
26 |
+
name: Exact Match
|
27 |
verified: true
|
28 |
+
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiYzIxMWZiZWM1MTJmMGIxM2I5NTFjNGI5OTJiNDdjODQ3NDNkYjRkYTI3ZmZkNGVmMGYzZDk5MTZhNDE4YzI1YiIsInZlcnNpb24iOjF9.QAj_iwD0yN2woSbGAN9xVRKoDKxldZbleFeJr77P2s7xWQBsKCuY0b5-2WIL79EcTCChvjNITeriPXqz8mGMAw
|
29 |
+
- type: f1
|
30 |
value: 55.135
|
31 |
+
name: F1
|
32 |
verified: true
|
33 |
+
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNjJkMzZjNTVhZTI5OTVhNTU4NDcyMjM1ZWJiODVjNzBhODRmZjlmMjE0MDUzMmU4NzNlNzA5NjgyODdkNTJmZSIsInZlcnNpb24iOjF9.O0KoLquXYbF3P2PGCFW8bxYEVe_yDW-WzEqpOmbIs_e9v4tcygH19ZUYFjMDFSll91SPJ2oIbVovsUISYuknCg
|
34 |
- task:
|
35 |
type: question-answering
|
36 |
name: Question Answering
|
|
|
40 |
config: squad_v2
|
41 |
split: validation
|
42 |
metrics:
|
43 |
+
- type: exact_match
|
|
|
44 |
value: 79.2432
|
45 |
+
name: Exact Match
|
46 |
verified: true
|
47 |
+
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNjBjZjhjMTMzMzZhOTg1OGYyZDY2MzZjYmQ4NmNlNWI5MWNmNTBiZjY1Njg0YTYyMmRlNzlkZDU1NTZjOWM5ZCIsInZlcnNpb24iOjF9.1vo9JoASJ_zvOVa4lTRMNPljUvMon-E6QOZ1n_KFQBMtRvRY883ECudhAzb5LGpLntyM2EN5bfyfTQ6dfjjsDg
|
48 |
+
- type: f1
|
49 |
value: 82.336
|
50 |
+
name: F1
|
51 |
verified: true
|
52 |
+
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiOWFiZGMyMzkwOTlkMWVkZmExNjdkZTM1YjRkYzRkZDlhOGZlMjEwMGNjNjJhYjM5MjZlNDI3ZDEyNmViOGYyOSIsInZlcnNpb24iOjF9.f3xlhop8hXWCCWFXWZgyK9r8Cy5KE3gPgYNV3bRN78teN_hjYH5sDl4wMTMcPU-bsPX70_wvsuvU-r95ByF4Bg
|
53 |
- task:
|
54 |
type: question-answering
|
55 |
name: Question Answering
|
|
|
59 |
config: default
|
60 |
split: validation
|
61 |
metrics:
|
62 |
+
- type: exact_match
|
|
|
63 |
value: 78.8581
|
64 |
+
name: Exact Match
|
65 |
verified: true
|
66 |
+
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiZjMyYTAxOWJhYTM5YWNmNGFhZDg3NTIwN2UxN2RhYzQxYzFiODJjYTcyZTk5MGMwODNhMzA3Nzc3MDQzYjcwMiIsInZlcnNpb24iOjF9.FSNswUf1Y5ZnlS0fSm-lxsA1klUphzfDhfj00U5benVd0QiYvyeqRclC7Pw8B3RV9Oe1cZzfeDDA5fXY2A5JBw
|
67 |
+
- type: f1
|
68 |
value: 82.8261
|
69 |
+
name: F1
|
70 |
verified: true
|
71 |
+
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNWQyMTNhZTc0MTdiMzNiNzc3YzhkNTk5ZWRkMWZlYjc4ZGU3YTFkNDkyZDg0NWFiYzFhMGQyMzZjYjcwNTE1YSIsInZlcnNpb24iOjF9.9waqQm_EBPo41pdOMmoY6r_-K7-3zUxt1AB4ndHTY50S5k5yyub8NdCJz09hBhbRd1_-1t3UT5p8HnFjAjF9DQ
|
|
|
|
|
|
|
|
|
|
|
|
|
72 |
---
|
73 |
|
74 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|