Independent evaluation results
#139
by
yaronr
- opened
Dear Meta-llama team,
I'm pleased to share our independent evaluation of the model using our implementation of the MMLU-Pro benchmark.
I hope you find this useful.