bedio commited on
Commit
48b9b9f
1 Parent(s): 6095219

Adding Evaluation Results (#2)

Browse files

- Adding Evaluation Results (c4b5b52e7d78f190c1d08ca03e8a2af60e4b86da)

Files changed (1) hide show
  1. README.md +14 -0
README.md CHANGED
@@ -301,3 +301,17 @@ Model is tested using lm-harness tool version 0.4.3
301
  ## Model Card Contact
302
 
303
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
301
  ## Model Card Contact
302
 
303
 
304
+
305
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
306
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_DeepAutoAI__Explore_Llama-3.2-1B-Inst)
307
+
308
+ | Metric |Value|
309
+ |-------------------|----:|
310
+ |Avg. |13.58|
311
+ |IFEval (0-Shot) |57.68|
312
+ |BBH (3-Shot) | 8.31|
313
+ |MATH Lvl 5 (4-Shot)| 4.53|
314
+ |GPQA (0-shot) | 1.57|
315
+ |MuSR (0-shot) | 1.09|
316
+ |MMLU-PRO (5-shot) | 8.31|
317
+