jpacifico commited on
Commit
1156cbf
1 Parent(s): 3f9d9ea

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -2
README.md CHANGED
@@ -22,8 +22,9 @@ Window context = 4k tokens
22
  ### Benchmarks
23
 
24
  Chocolatine is the best-performing 3B model on the [OpenLLM Leaderboard](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard) (august 2024)
 
25
 
26
- ![image/png](https://github.com/jpacifico/Chocolatine-LLM/blob/main/Assets/openllm_choco3b_revised.png?raw=false)
27
 
28
 
29
  | Metric |Value|
@@ -40,7 +41,7 @@ Chocolatine is the best-performing 3B model on the [OpenLLM Leaderboard](https:/
40
  ### MT-Bench-French
41
 
42
  Chocolatine-3B-Instruct-DPO-Revised is outperforming GPT-3.5-Turbo on [MT-Bench-French](https://huggingface.co/datasets/bofenghuang/mt-bench-french), used with [multilingual-mt-bench](https://github.com/Peter-Devine/multilingual_mt_bench) and GPT-4-Turbo as LLM-judge.
43
- Notably, this latest version of the Chocolatine-3B model is approaching the performance of Phi-3-Medium (14B) in French, which is a remarkable achievement.
44
 
45
  ```
46
  ########## First turn ##########
 
22
  ### Benchmarks
23
 
24
  Chocolatine is the best-performing 3B model on the [OpenLLM Leaderboard](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard) (august 2024)
25
+ [Update 2024-08-22] Chocolatine-3B also outperforms Microsoft's new model Phi-3.5-mini-instruct on the average benchmarks of the 3B category.
26
 
27
+ ![image/png](https://github.com/jpacifico/Chocolatine-LLM/blob/main/Assets/openllm_chocolatine_3B_22082024.png?raw=false)
28
 
29
 
30
  | Metric |Value|
 
41
  ### MT-Bench-French
42
 
43
  Chocolatine-3B-Instruct-DPO-Revised is outperforming GPT-3.5-Turbo on [MT-Bench-French](https://huggingface.co/datasets/bofenghuang/mt-bench-french), used with [multilingual-mt-bench](https://github.com/Peter-Devine/multilingual_mt_bench) and GPT-4-Turbo as LLM-judge.
44
+ Notably, this latest version of the Chocolatine-3B model is approaching the performance of Phi-3-Medium (14B) in French.
45
 
46
  ```
47
  ########## First turn ##########