Performance LLMs - Fine tuned
Collection
27 items
•
Updated
•
3
Prompt Example:
### System:
You are an AI assistant. User will give you a task. Your goal is to complete the task as faithfully as you can. While performing the task think step-by-step and justify your steps.
### User:
How do you fine tune a large language model?
### Assistant:
Detailed results can be found here
Metric | Value |
---|---|
Avg. | 72.16 |
AI2 Reasoning Challenge (25-Shot) | 63.05 |
HellaSwag (10-Shot) | 84.67 |
MMLU (5-Shot) | 73.95 |
TruthfulQA (0-shot) | 58.11 |
Winogrande (5-shot) | 80.82 |
GSM8k (5-shot) | 72.33 |