jpacifico commited on
Commit
fa3e742
1 Parent(s): 558125f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -0
README.md CHANGED
@@ -26,6 +26,18 @@ Chocolatine is the **best-performing 3B model** on the [OpenLLM Leaderboard](htt
26
 
27
  ![image/png](https://github.com/jpacifico/Chocolatine-LLM/blob/main/Assets/openllm_choco3b_revised.png?raw=false)
28
 
 
 
 
 
 
 
 
 
 
 
 
 
29
  ### MT-Bench-French
30
 
31
  Chocolatine-3B-Instruct-DPO-Revised is outperforming GPT-3.5-Turbo on [MT-Bench-French](https://huggingface.co/datasets/bofenghuang/mt-bench-french) by Bofeng Huang,
 
26
 
27
  ![image/png](https://github.com/jpacifico/Chocolatine-LLM/blob/main/Assets/openllm_choco3b_revised.png?raw=false)
28
 
29
+
30
+ | Metric |Value|
31
+ |-------------------|----:|
32
+ |Avg. |27.63|
33
+ |IFEval (0-Shot) |56.23|
34
+ |BBH (3-Shot) |37.16|
35
+ |MATH Lvl 5 (4-Shot)|14.5|
36
+ |GPQA (0-shot) |9.62|
37
+ |MuSR (0-shot) |15.1|
38
+ |MMLU-PRO (5-shot) |33.21|
39
+
40
+
41
  ### MT-Bench-French
42
 
43
  Chocolatine-3B-Instruct-DPO-Revised is outperforming GPT-3.5-Turbo on [MT-Bench-French](https://huggingface.co/datasets/bofenghuang/mt-bench-french) by Bofeng Huang,