Update README.md
#18
by
Q-bert
- opened
README.md
CHANGED
@@ -16,7 +16,7 @@ Independent Benchmark Results:
|
|
16 |
- MATH: 100% (0-shot Reflection)
|
17 |
- GSM8K: 100% (0-shot Reflection)
|
18 |
- IFEval: 100% (0-shot Reflection)
|
19 |
-
- TruthfulQA:
|
20 |
|
21 |
Independent Contamination Results:
|
22 |
- GPQA: 0%
|
|
|
16 |
- MATH: 100% (0-shot Reflection)
|
17 |
- GSM8K: 100% (0-shot Reflection)
|
18 |
- IFEval: 100% (0-shot Reflection)
|
19 |
+
- TruthfulQA: 100% (0-shot Reflection)
|
20 |
|
21 |
Independent Contamination Results:
|
22 |
- GPQA: 0%
|