Spaces:

open-llm-leaderboard
/

open_llm_leaderboard

Running on CPU Upgrade

App Files Files Community

1020

Does Yi models have eval issues?

#393

by migtissera - opened Nov 21, 2023

Discussion

migtissera

Nov 21, 2023

Hey! My Tess-Medium-200K-v1.0 model (renamed to Tess-M-Creative-v1.0) have been running for 2 days now. Is there an error? It hasn't failed, it's still in the "Running Evaluation" queue.

This is the model: https://huggingface.co/migtissera/Tess-M-Creative-v1.0

clefourrier

Open LLM Leaderboard org Nov 21, 2023

•

edited Nov 21, 2023

Hi!
Could you please link to the request file? There are many reasons why this could have happened :)

ndurkee

Nov 21, 2023

Here is one of them.

https://huggingface.co/datasets/open-llm-leaderboard/requests/blob/main/migtissera/Tess-Medium-200K-v1.0_eval_request_False_float16_Original.json

Status lists as failed. Same with deepseek instruct.

clefourrier

Open LLM Leaderboard org Nov 21, 2023

The specific model you linked was cancelled (because a more important job was launched on the cluster) and automatically requeued - we're apparently still having a small issue with our display.

SaylorTwift

Open LLM Leaderboard org Nov 22, 2023

Hi ! Your model actually failed because of network error on our side, I will re-add it to the queue, thanks for your patience :)

migtissera

Nov 22, 2023

Thank you @SaylorTwift

migtissera

Nov 22, 2023

These three models have failed as well, and no idea why

https://huggingface.co/migtissera/Tess-XL-v1.0
https://huggingface.co/migtissera/SynthIA-7B-v2.0
https://huggingface.co/migtissera/SynthIA-70B-v1.5
Could you please take a look @SaylorTwift

clefourrier

Open LLM Leaderboard org Nov 22, 2023

•

edited Nov 22, 2023

Hi @migtissera ,
You'll find the request files for your models here. If you point to them next time, we'll be able to debug your problems faster, as they contain the job id which allows us to look at the logs :)
The first model failed because the launching system had a problem launching this big a model (we will need to launch it on multiple nodes).
The other two were cancelled for priority reasons, I'll let @SaylorTwift tell you if they were rescheduled (and relaunch them if not) - the cluster is very full at the moment though, so it might take some time before they are evaluated.

migtissera

Nov 22, 2023

Thanks @clefourrier ! I didn't know where to find this.. Will do this from next one onwards.. :)

clefourrier

Open LLM Leaderboard org Nov 24, 2023

I'm going to close this issue for now, feel free to reopen if needed

clefourrier changed discussion status to closed Nov 24, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment