raspberry-3B / README.md
qnguyen3's picture
Update README.md
66bf134 verified
|
raw
history blame
1.27 kB
metadata
library_name: transformers
license: other
license_name: qwen-research
license_link: https://huggingface.co/Qwen/Qwen2.5-3B-Instruct/blob/main/LICENSE
base_model: Qwen/Qwen2.5-3B
tags:
  - generated_from_trainer
model-index:
  - name: outputs/gelato-3b
    results: []

Prompt Format: ChatML

This is an experimental which was heavily optimized for reasoning tasks and not meant for production-use.

GGUFs: https://huggingface.co/mradermacher/raspberry-3B-GGUF

image/png

image/png

Open LLM Leaderboard Evaluation Results

Metric Value
Avg. 29.79
IFEval (0-Shot) 32.12
BBH (3-Shot) 42.23
MATH Lvl 5 (4-Shot) 8.16
GPQA (0-shot) 27.10
MuSR (0-shot) 40.61
MMLU-PRO (5-shot) 28.49