Spaces:
Running
on
CPU Upgrade
Running
on
CPU Upgrade
title: European Leaderboard | |
emoji: π | |
colorFrom: blue | |
colorTo: blue | |
sdk: gradio | |
sdk_version: 4.19.2 | |
app_file: app.py | |
pinned: false | |
license: unknown | |
This is the OpenGPT-X mutlilingual leaderboard source code repository. | |
The leaderboard aims to provied an overview of LLM performance over various languages. | |
The basic task set consists of MMLU, ARC, HellaSwag, GSM8k, TruthfulQA and belebele. | |
To make the results comparable to the Open LLM leaderboard (https://huggingface.co/open-llm-leaderboard) we selected the former five tasks based on our internal machine translations of the English base tasks, in addition to the high-quality multilingual benchmark belebele by Meta. | |