MaziyarPanahi
commited on
Commit
•
a619d18
1
Parent(s):
2883125
Update README.md (#3)
Browse files- Update README.md (0508818bd5e3652fedaf2e57519c782c0bc6e75f)
README.md
CHANGED
@@ -156,6 +156,21 @@ So, 25 - 4 * 2 + 3 = 20.</s>
|
|
156 |
|
157 |
## Eval
|
158 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
159 |
source: https://huggingface.co/datasets/open-llm-leaderboard/details_MaziyarPanahi__Bioxtral-4x7B-v0.1
|
160 |
|
161 |
```python
|
|
|
156 |
|
157 |
## Eval
|
158 |
|
159 |
+
![image/png](https://cdn-uploads.huggingface.co/production/uploads/5fd5e18a90b6dc4633f6d292/PR-Py7u6uhcxKTdCpPY4-.png)
|
160 |
+
|
161 |
+
| Metric | BioMistral-7B | Bioxtral-4x7B-v0.1 |
|
162 |
+
|-----------------------------|---------------|--------------------|
|
163 |
+
| **Average** | 54.99 | **70.84** |
|
164 |
+
| ARC | 54.27 | **68.34** |
|
165 |
+
| HellaSwag | 79.09 | **87.27** |
|
166 |
+
| TruthfulQA | 51.61 | **68.45** |
|
167 |
+
| Winogrande | 73.48 | **82.90** |
|
168 |
+
| GSM8K | 0 | **56.63** |
|
169 |
+
| Professional Medicine | 55.51 | **67.3** |
|
170 |
+
| College Medicine | 58.96 | **61.84** |
|
171 |
+
| Medical Genetics | 67.00 | **74.0** |
|
172 |
+
|
173 |
+
|
174 |
source: https://huggingface.co/datasets/open-llm-leaderboard/details_MaziyarPanahi__Bioxtral-4x7B-v0.1
|
175 |
|
176 |
```python
|