Independent evaluation results

#139
by yaronr - opened

Dear Meta-llama team,

I'm pleased to share our independent evaluation of the model using our implementation of the MMLU-Pro benchmark.

I hope you find this useful.

Sign up or log in to comment