Spaces:
Running
Running
title: README | |
emoji: π | |
colorFrom: blue | |
colorTo: red | |
sdk: static | |
pinned: false | |
We present Auto-Arena of LLMs, which automates the entire evaluation process with LLM-based agents to provide automatic, reliable, and human-like LLM evaluations. | |
Feel free to check out: | |
* [project page](https://auto-arena.github.io/) | |
* [paper](https://arxiv.org/abs/2405.20267) | |
* [Auto-Arena Leaderboard](https://huggingface.co/spaces/Auto-Arena/Leaderboard) |