AlanRobotics commited on
Commit
59b666e
1 Parent(s): 66fee5d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +19 -9
README.md CHANGED
@@ -131,12 +131,22 @@ The model was trained in two stages. In the first stage, MLP layers were trained
131
 
132
  ### ru-llm-arena: **31.2** (local measurement)
133
 
134
- | **Model** | **Score** | **95% CI** | **Avg Tokens** |
135
- | ------------------------------------------- | --------- | --------------- | -------------- |
136
- | **MTSAIR/Cotype-Nano** | **31.2** | **+1.7 / -1.9** | **567** |
137
- | storm-7b | 20.62 | +2.0 / -1.6 | 419.32 |
138
- | neural-chat-7b-v3-3 | 19.04 | +2.0 / -1.7 | 927.21 |
139
- | Vikhrmodels-Vikhr-Llama-3.2-1B-instruct | 19.04 | +1.3 / -1.6 | 958.63 |
140
- | gigachat_lite | 17.2 | +1.4 / -1.4 | 276.81 |
141
- | Vikhrmodels-vikhr-qwen-1.5b-it | 13.19 | +1.4 / -1.6 | 2495.38 |
142
- | meta-llama-Llama-3.2-1B-Instruct | 4.04 | +0.8 / -0.6 | 1240.53 |
 
 
 
 
 
 
 
 
 
 
 
131
 
132
  ### ru-llm-arena: **31.2** (local measurement)
133
 
134
+ | Model | Score | 95% CI | Avg. #Tokens |
135
+ |---------------------------------------------|-------|-------------------------|---------------|
136
+ | **Cotype-Nano** | **31.2** | **+1.7 / -1.9** | **567** |
137
+ | hermes-2-pro-llama-3-8b | 30.8 | +2.0 / 2.2 | 463.45 |
138
+ | openchat-3.6-8b-20240522 | 30.3 | +2.2 / -1.6 | 428.7 |
139
+ | vikhr-it-5.3-fp16-32k | 27.8 | +1.5 / -2.1 | 519.71 |
140
+ | vikhr-it-5.3-fp16 | 22.73 | +1.8 / -1.7 | 523.45 |
141
+ | kolibri-vikhr-mistral-0427 | 22.41 | +1.6 / -1.9 | 489.89 |
142
+ | snorkel-mistral-pairrm-dpo | 22.41 | +1.7 / -1.6 | 773.8 |
143
+ | **Cotype-Nano-4bit** | **21.6** | **+2.1 / -1.8** | **587** |
144
+ | storm-7b | 20.62 | +1.4 / -1.6 | 419.32 |
145
+ | neural-chat-7b-v3-3 | 19.04 | +1.8 / -1.5 | 927.21 |
146
+ | Vikhrmodels-Vikhr-Llama-3.2-1B-instruct | 19.04 | +1.2 / -1.5 | 958.63 |
147
+ | gigachat_lite | 17.2 | +1.5 / -1.5 | 276.81 |
148
+ | Vikhrmodels-Vikhr-Qwen-2.5-0.5b-Instruct | 16.5 | +1.5 / -1.7 | 583.5 |
149
+ | Qwen-Qwen2.5-1.5B-Instruct | 16.46 | +1.3 / -1.3 | 483.67 |
150
+ | Vikhrmodels-vikhr-qwen-1.5b-it | 13.19 | +1.3 / -1.1 | 2495.38 |
151
+ | meta-llama-Llama-3.2-1B-Instruct | 4.04 | +0.6 / -0.8 | 1240.53 |
152
+ | Qwen-Qwen2.5-0.5B-Instruct | 4.02 | +0.7 / -0.8 | 829.87 |