Official results for common benchmarks like MMLU, GSM8K and others
#23
by
JosephusCheung
- opened
This will help inform the overall performance of the model.
Thanks for this comment. We have run the model on the open leaderboard - you can find the results here: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard
sarahooker
changed discussion status to
closed