jpacifico
/

Chocolatine-3B-Instruct-DPO-Revised

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

jpacifico commited on 29 days ago

Commit

1156cbf

•

1 Parent(s): 3f9d9ea

Update README.md

Files changed (1) hide show

README.md +3 -2

README.md CHANGED Viewed

@@ -22,8 +22,9 @@ Window context = 4k tokens
 ### Benchmarks
 Chocolatine is the best-performing 3B model on the [OpenLLM Leaderboard](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard) (august 2024)
-![image/png](https://github.com/jpacifico/Chocolatine-LLM/blob/main/Assets/openllm_choco3b_revised.png?raw=false)
 |      Metric       |Value|
@@ -40,7 +41,7 @@ Chocolatine is the best-performing 3B model on the [OpenLLM Leaderboard](https:/
 ### MT-Bench-French
 Chocolatine-3B-Instruct-DPO-Revised is outperforming GPT-3.5-Turbo on [MT-Bench-French](https://huggingface.co/datasets/bofenghuang/mt-bench-french), used with [multilingual-mt-bench](https://github.com/Peter-Devine/multilingual_mt_bench) and GPT-4-Turbo as LLM-judge.
-Notably, this latest version of the Chocolatine-3B model is approaching the performance of Phi-3-Medium (14B) in French, which is a remarkable achievement.
 ```
 ########## First turn ##########

 ### Benchmarks
 Chocolatine is the best-performing 3B model on the [OpenLLM Leaderboard](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard) (august 2024)
+[Update 2024-08-22] Chocolatine-3B also outperforms Microsoft's new model Phi-3.5-mini-instruct on the average benchmarks of the 3B category.
+![image/png](https://github.com/jpacifico/Chocolatine-LLM/blob/main/Assets/openllm_chocolatine_3B_22082024.png?raw=false)
 |      Metric       |Value|
 ### MT-Bench-French
 Chocolatine-3B-Instruct-DPO-Revised is outperforming GPT-3.5-Turbo on [MT-Bench-French](https://huggingface.co/datasets/bofenghuang/mt-bench-french), used with [multilingual-mt-bench](https://github.com/Peter-Devine/multilingual_mt_bench) and GPT-4-Turbo as LLM-judge.
+Notably, this latest version of the Chocolatine-3B model is approaching the performance of Phi-3-Medium (14B) in French.
 ```
 ########## First turn ##########