Mistral 7B Arc Easy Contamination based on "Proving Test Set Contamination in Black Box Language Models"
#14
by
AmeyaPrabhu
- opened
- contamination_report.csv +2 -0
contamination_report.csv
CHANGED
@@ -597,3 +597,5 @@ ibragim-bad/arc_challenge;;FLAN;model;;15.6;;data-based;https://arxiv.org/abs/21
|
|
597 |
facebook/anli;dev_r3;FLAN;model;;40.2;;data-based;https://arxiv.org/abs/2109.01652;13
|
598 |
facebook/anli;dev_r2;FLAN;model;;97.9;;data-based;https://arxiv.org/abs/2109.01652;13
|
599 |
facebook/anli;dev_r1;FLAN;model;;98.6;;data-based;https://arxiv.org/abs/2109.01652;13
|
|
|
|
|
|
597 |
facebook/anli;dev_r3;FLAN;model;;40.2;;data-based;https://arxiv.org/abs/2109.01652;13
|
598 |
facebook/anli;dev_r2;FLAN;model;;97.9;;data-based;https://arxiv.org/abs/2109.01652;13
|
599 |
facebook/anli;dev_r1;FLAN;model;;98.6;;data-based;https://arxiv.org/abs/2109.01652;13
|
600 |
+
|
601 |
+
ibragim-bad/arc_easy;;mistralai/Mistral-7B-v0.1;model;;;100.0;model-based;https://arxiv.org/abs/2310.17623;14
|