Spaces:

open-llm-leaderboard
/

open_llm_leaderboard

Running on CPU Upgrade

App Files Files Community

958

Is `HuggingFaceH4/human_eval_llm_leaderboard` abandoned?

#355

by zhiminy - opened Nov 8, 2023

Discussion

zhiminy

Nov 8, 2023

•

edited Nov 8, 2023

Is https://huggingface.co/spaces/HuggingFaceH4/human_eval_llm_leaderboard abandoned? I saw it malfunctioning for a while. I also left a message to the maintainer but nobody replies.

clefourrier

Open LLM Leaderboard org Nov 8, 2023

I know that there have been some changes in the team working on this leaderboard, so I'm unsure.
@lewtun @edbeeching might know?

supercharge19

Nov 8, 2023

The new models are not being mentioned in the leaderboard. New models I would like to see appear are Zephyr beta, airoboros (mistral2.2 something like that) and dolphin2.2 and hermes (mistral).

clefourrier

Open LLM Leaderboard org Nov 8, 2023

To avoid confusion for other users, and since this issue is unrelated with the OpenLLMLeaderboard, I'll close it.

I trust Lewis or Ed will come by and answer it once they are on.

clefourrier changed discussion status to closed Nov 8, 2023

lewtun

Open LLM Leaderboard org Nov 9, 2023

Hello @zhiminy @supercharge19 ! For human evaluation we recommend checking out LMSYS's Chatbot Arena(https://chat.lmsys.org) which is actively maintained and the community standard in this domain. Our human eval leaderboard is now archived, but kept public for reference :)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment