jpacifico
/

Chocolatine-14B-Instruct-4k-DPO

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

jpacifico commited on Aug 10

Commit

b2d8214

•

1 Parent(s): 30677e5

Update README.md

Files changed (1) hide show

README.md +15 -3

README.md CHANGED Viewed

@@ -16,13 +16,25 @@ pipeline_tag: text-generation
 DPO fine-tuned of [microsoft/Phi-3-medium-4k-instruct](https://huggingface.co/microsoft/Phi-3-medium-4k-instruct) (14B params)
 using the [jpacifico/french-orca-dpo-pairs-revised](https://huggingface.co/datasets/jpacifico/french-orca-dpo-pairs-revised) rlhf dataset.
-Training in French also improves the model in English, surpassing the performances of its base model.
 Window context = 4k tokens
 ### Benchmarks
-Submitted on the [OpenLLM Leaderboard](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard) (aug 2024)
-Results coming soon.
 ### MT-Bench-French

 DPO fine-tuned of [microsoft/Phi-3-medium-4k-instruct](https://huggingface.co/microsoft/Phi-3-medium-4k-instruct) (14B params)
 using the [jpacifico/french-orca-dpo-pairs-revised](https://huggingface.co/datasets/jpacifico/french-orca-dpo-pairs-revised) rlhf dataset.
+Training in French also improves the model in English, surpassing the performances of its base model (MMLU).
 Window context = 4k tokens
 ### Benchmarks
+Chocolatine-14B is the best-performing < 50B model in terms of MMLU-PRO on the [OpenLLM Leaderboard](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard) (august 2024)
+![image/png](https://github.com/jpacifico/Chocolatine-LLM/blob/main/Assets/benchmark_14B_V1.png?raw=false)
+|      Metric       |Value|
+|-------------------|----:|
+|Avg.               |29.83|
+|IFEval (0-Shot)    |46.89|
+|BBH (3-Shot)       |48.02|
+|MATH Lvl 5 (4-Shot)|14.88|
+|GPQA (0-shot)      |12.19|
+|MuSR (0-shot)      |15.15|
+|**MMLU-PRO (5-shot)**  |**41.82**|
 ### MT-Bench-French