Update README.md
Browse files
README.md
CHANGED
@@ -62,11 +62,13 @@ We achieve the following MT-Bench scores across 6 languages:
|
|
62 |
| **Russian** π·πΊ | NaN | 8.19 | 8.28 | 7.94 |
|
63 |
| **Chinese** π¨π³ | NaN | 7.11 | 6.97 | 7.55 |
|
64 |
| **English** πΊπΈ | 7.98 | 7.73 | 7.92 | 8.26 |
|
|
|
65 |
|
66 |
We observe minimal degredation of Llama 3's English ability while achieving best-in-class multilingual abilities compared to the top rated 7B model ([Nexusflow/Starling-LM-7B-beta](https://huggingface.co/Nexusflow/Starling-LM-7B-beta)) on the [Chatbot Arena Leaderboard](https://chat.lmsys.org/?leaderboard).
|
67 |
|
68 |
[Here is our evaluation script.](https://drive.google.com/file/d/15HPn7452t8LbTD9HKSl7ngYYWnsoOG08/view?usp=sharing)
|
69 |
|
|
|
70 |
# Training data
|
71 |
|
72 |
We train on three sources of data to create this model:
|
|
|
62 |
| **Russian** π·πΊ | NaN | 8.19 | 8.28 | 7.94 |
|
63 |
| **Chinese** π¨π³ | NaN | 7.11 | 6.97 | 7.55 |
|
64 |
| **English** πΊπΈ | 7.98 | 7.73 | 7.92 | 8.26 |
|
65 |
+
(Note the Russian scores exclude code, reasoning and math problems due to not having any translated reference answers for these questions.)
|
66 |
|
67 |
We observe minimal degredation of Llama 3's English ability while achieving best-in-class multilingual abilities compared to the top rated 7B model ([Nexusflow/Starling-LM-7B-beta](https://huggingface.co/Nexusflow/Starling-LM-7B-beta)) on the [Chatbot Arena Leaderboard](https://chat.lmsys.org/?leaderboard).
|
68 |
|
69 |
[Here is our evaluation script.](https://drive.google.com/file/d/15HPn7452t8LbTD9HKSl7ngYYWnsoOG08/view?usp=sharing)
|
70 |
|
71 |
+
|
72 |
# Training data
|
73 |
|
74 |
We train on three sources of data to create this model:
|