bedio commited on
Commit
c4b5b52
1 Parent(s): 2f17c5f

Adding Evaluation Results

Browse files

This is an automated PR created with https://huggingface.co/spaces/Weyaxi/open-llm-leaderboard-results-pr

The purpose of this PR is to add evaluation results from the Open LLM Leaderboard to your model card.

If you encounter any issues, please report them to https://huggingface.co/spaces/Weyaxi/open-llm-leaderboard-results-pr/discussions

Files changed (1) hide show
  1. README.md +14 -0
README.md CHANGED
@@ -314,3 +314,17 @@ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-le
314
  |MuSR (0-shot) | 1.09|
315
  |MMLU-PRO (5-shot) | 8.31|
316
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
314
  |MuSR (0-shot) | 1.09|
315
  |MMLU-PRO (5-shot) | 8.31|
316
 
317
+
318
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
319
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_DeepAutoAI__Explore_Llama-3.2-1B-Inst)
320
+
321
+ | Metric |Value|
322
+ |-------------------|----:|
323
+ |Avg. |13.58|
324
+ |IFEval (0-Shot) |57.68|
325
+ |BBH (3-Shot) | 8.31|
326
+ |MATH Lvl 5 (4-Shot)| 4.53|
327
+ |GPQA (0-shot) | 1.57|
328
+ |MuSR (0-shot) | 1.09|
329
+ |MMLU-PRO (5-shot) | 8.31|
330
+