Remek commited on
Commit
2a5de68
1 Parent(s): 1dd6e6b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -175,7 +175,7 @@ The results from the Open PL LLM Leaderboard demonstrate the exceptional perform
175
 
176
  1. Superior performance in its class: Bielik-11B-v2.3-Instruct outperforms all other models with less than 70B parameters. This is a significant achievement, showcasing its efficiency and effectiveness despite having fewer parameters than many competitors.
177
 
178
- 2. Competitive with larger models: with a score of ~~65.45~~, Bielik-11B-v2.3-Instruct performs on par with models in the 70B parameter range. This indicates that it achieves comparable results to much larger models, demonstrating its advanced architecture and training methodology.
179
 
180
  3. Substantial improvement over previous version: the model shows a marked improvement over its predecessor, Bielik-7B-Instruct-v0.1, which scored 43.64. This leap in performance highlights the successful enhancements and optimizations implemented in this newer version.
181
 
@@ -195,7 +195,7 @@ This section presents a focused comparison of generative Polish language task pe
195
  | Bielik-11B-v2.0-Instruct | 11 | 65.58 |
196
  | gpt-3.5-turbo-instruct | Unknown | 55.65 |
197
 
198
- The performance variation among Bielik versions is minimal, indicating consistent quality across iterations. Bielik-11B-v2.3-Instruct demonstrates an impressive ~~19.6%~~ performance advantage over GPT-3.5.
199
 
200
 
201
  ### Open LLM Leaderboard
 
175
 
176
  1. Superior performance in its class: Bielik-11B-v2.3-Instruct outperforms all other models with less than 70B parameters. This is a significant achievement, showcasing its efficiency and effectiveness despite having fewer parameters than many competitors.
177
 
178
+ 2. Competitive with larger models: with a score of 65.71, Bielik-11B-v2.3-Instruct performs on par with models in the 70B parameter range. This indicates that it achieves comparable results to much larger models, demonstrating its advanced architecture and training methodology.
179
 
180
  3. Substantial improvement over previous version: the model shows a marked improvement over its predecessor, Bielik-7B-Instruct-v0.1, which scored 43.64. This leap in performance highlights the successful enhancements and optimizations implemented in this newer version.
181
 
 
195
  | Bielik-11B-v2.0-Instruct | 11 | 65.58 |
196
  | gpt-3.5-turbo-instruct | Unknown | 55.65 |
197
 
198
+ The performance variation among Bielik versions is minimal, indicating consistent quality across iterations. Bielik-11B-v2.3-Instruct demonstrates an impressive 21.2% performance advantage over GPT-3.5.
199
 
200
 
201
  ### Open LLM Leaderboard