Spaces:
Sleeping
Sleeping
Commit History
clarified how to get MMLU average
6095833
Clémentine
commited on
rm lighteval sha from id
2bb5ded
Clémentine
commited on
added precision
6eaad72
Clémentine
commited on
fix rounding
d350941
Clémentine
commited on
corrected display of symbols
35763fc
Clémentine
commited on
Merge branch 'main' into link_requests_and_results
a79408c
Nathan Habib
commited on
get model type info from request file
80f4eeb
Nathan Habib
commited on
added more nuance in ft models
95f85ed
Clémentine
commited on
Update src/auto_leaderboard/model_metadata_type.py (#159)
eedc0f1
corrected finetuned mentions
0f4a719
Clémentine
commited on
Update newcomers (#153)
d7daa68
look for model type in request file
d295afa
Nathan Habib
commited on
removed default value for model type
ef627e9
Clémentine
commited on
Update new model types (#150)
9977ce1
Fix Typo (#144)
6ddee3c
add two more (#142)
5d5681a
More metadata type updates (#141)
3602349
Wording: number of parameters? (#138)
edd3aac
Add type to more models (#140)
6d0d93c
model type filters (#139)
601f2e9
updated design to select columns to display
ecef2dc
Clémentine
commited on
Update app.py
613696b
Short emoji legend on first page (#133)
a66fcca
adding count of models in evaluation queue and finished status (#127)
740b29d
only display the scores for the latest result file
d6b3d82
Nathan Habib
commited on
Update model types (#126)
9d5015b
Added icons for types + fixed pending queue
b323764
Clémentine
commited on
wip adding symbols to model types
217b585
Clémentine
commited on
fix new config name
4aff44e
Nathan Habib
commited on
fix when looking for addapter model in hub
2f6ebf5
Nathan Habib
commited on
FT: precision and adapter models
12cea14
Clémentine
commited on
added selector for model type
99b25b8
Clémentine
commited on
Update src/assets/text_content.py
a0b557b
updated model param number reader
1df8383
Clémentine
commited on
updated version
788108a
Clémentine
commited on
added precision for truthfulqa 6 shot
18916e3
Clémentine
commited on
fix view
00358b1
Clémentine
commited on
moved the submit to a tab since the results are becoming very long
8dfa543
Clémentine
commited on
Small fix - we do not want to display models where the MMLU is old with models where the MMLU is new - however, since version is displayed in the results, we keep the files
97b27da
Clémentine
commited on
Add details on the datasets for reproducibility (#107)
256c5d3
Using the new backend
d16cee2
Linker1907
commited on
small fix link Ilyas leaderboard
e868f35
Clémentine
commited on
added harness command
d2e8eca
Clémentine
commited on
revamp
6e8f400
Clémentine
commited on
column fix
d52179b
Clémentine
commited on
merge refactor
460d762
Clémentine
commited on
Update Vicuna link
a7cba30
sheonhan
commited on
Adjust description for TruthfulQA
5601a63
NimaBoscarino
commited on
Copy change
ce824ba
sheonhan
commited on