Update README.md
Browse files
README.md
CHANGED
@@ -35,4 +35,14 @@ Benchmark Scores
|
|
35 |
|
36 |
| Tasks |Version|Filter|n-shot|Metric|Value | |Stderr|
|
37 |
|----------|------:|------|-----:|------|-----:|---|-----:|
|
38 |
-
|winogrande| 1|none | 0|acc |0.7774|± |0.0117|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
35 |
|
36 |
| Tasks |Version|Filter|n-shot|Metric|Value | |Stderr|
|
37 |
|----------|------:|------|-----:|------|-----:|---|-----:|
|
38 |
+
|winogrande| 1|none | 0|acc |0.7774|± |0.0117|
|
39 |
+
|
40 |
+
|Tasks|Version| Filter |n-shot| Metric |Value | |Stderr|
|
41 |
+
|-----|------:|----------|-----:|-----------|-----:|---|-----:|
|
42 |
+
|gsm8k| 2|get-answer| 5|exact_match|0.6732|± |0.0129|
|
43 |
+
|
44 |
+
| Tasks |Version|Filter|n-shot|Metric|Value | |Stderr|
|
45 |
+
|--------------|------:|------|-----:|------|-----:|---|-----:|
|
46 |
+
|truthfulqa_mc2| 2|none | 0|acc |0.4795|± |0.0148|
|
47 |
+
|
48 |
+
Average 65.658
|