Open source benchmarks?

by jweissenberger - opened Jul 5, 2023

Jul 5, 2023

•

edited Jul 5, 2023

Has anyone measured this model on opensource LLM benchmarks? The 7 billion mpt model is on the HF LLM leader board here and there are some metrics are found in this blog but I'd like to see the performance on things like Hellaswag, winogrande, PIQA, MMLU or similar benchmarks if they're available.

Edit: I found MMLU score here at 47.8

jfrankle

Jul 6, 2023

You can see all of the benchmarks you mentioned in the blog here.

jfrankle changed discussion status to closed Jul 6, 2023

jweissenberger

Jul 7, 2023

@jfrankle Those metrics are for your base MPT model not the instruction tuned version.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment