Accessing examples used for n-shot evals

#26
by akritivij - opened

For n-shot evaluations, is it possible to access the specific examples used for the evals? e.g. SQuAD2 is run using 4-shot - is it possible to find out what the 4 examples are?

Sign up or log in to comment