PENNY
Collection
8 items
•
Updated
•
1
base_model: meta-llama/Meta-Llama-3-8B gate_mode: hidden dtype: bfloat16 experts:
Metric | Value |
---|---|
Avg. | 65.13 |
AI2 Reasoning Challenge (25-Shot) | 62.80 |
HellaSwag (10-Shot) | 83.60 |
MMLU (5-Shot) | 65.13 |
TruthfulQA (0-shot) | 50.41 |
Winogrande (5-shot) | 77.27 |
GSM8k (5-shot) | 58.68 |