Spaces:
Restarting
Restarting
Commit History
adding count of models in evaluation queue and finished status (#127)
740b29d
only display the scores for the latest result file
d6b3d82
Nathan Habib
commited on
Update model types (#126)
9d5015b
Added icons for types + fixed pending queue
b323764
Clémentine
commited on
wip adding symbols to model types
217b585
Clémentine
commited on
fix new config name
4aff44e
Nathan Habib
commited on
fix when looking for addapter model in hub
2f6ebf5
Nathan Habib
commited on
FT: precision and adapter models
12cea14
Clémentine
commited on
added selector for model type
99b25b8
Clémentine
commited on
Update src/assets/text_content.py
a0b557b
updated model param number reader
1df8383
Clémentine
commited on
updated version
788108a
Clémentine
commited on
added precision for truthfulqa 6 shot
18916e3
Clémentine
commited on
fix view
00358b1
Clémentine
commited on
moved the submit to a tab since the results are becoming very long
8dfa543
Clémentine
commited on
Small fix - we do not want to display models where the MMLU is old with models where the MMLU is new - however, since version is displayed in the results, we keep the files
97b27da
Clémentine
commited on
Add details on the datasets for reproducibility (#107)
256c5d3
Using the new backend
d16cee2
Linker1907
commited on
small fix link Ilyas leaderboard
e868f35
Clémentine
commited on
added harness command
d2e8eca
Clémentine
commited on
revamp
6e8f400
Clémentine
commited on
column fix
d52179b
Clémentine
commited on
merge refactor
460d762
Clémentine
commited on
Update Vicuna link
a7cba30
sheonhan
commited on
Adjust description for TruthfulQA
5601a63
NimaBoscarino
commited on
Copy change
ce824ba
sheonhan
commited on
Fix elo ratings model links
e05ec6c
sheonhan
commited on
Add custom url for second tab
7644705
sheonhan
commited on
Still return tab without query params
6a6e05c
sheonhan
commited on
Link to discussion with custom url
8cb7546
sheonhan
commited on
Update tab button
b5f5045
sheonhan
commited on
Update app.py
7a429ab
natolambert
commited on
Update deps
39cc014
sheonhan
commited on
Add GPT-4 & human eval tab
0227006
sheonhan
commited on
Upload scale-hf-logo.png
9cea2a5
sheonhan
commited on
Delete scale-hf-logo.png
74ff6f5
sheonhan
commited on
Upload scale-hf-logo.png
adaa4ee
sheonhan
commited on
adding citations
e61a555
Update CHANGELOG
b3f0642
sheonhan
commited on
Add search emoji
92ae76d
sheonhan
commited on
Search on ENTER
48c5442
sheonhan
commited on
Increase concurrency count
f458f0b
sheonhan
commited on
import datetime correctly
d35aee2
sheonhan
commited on
Fix bullet point about evaluation
3b93b88
sheonhan
commited on
Update CHANGELOG
b29b985
sheonhan
commited on
record submitted time
8696209
sheonhan
commited on
style clean up
aa7c3f4
sheonhan
commited on
implements search bar
ffefe11
sheonhan
commited on
format utils.py
2102b66
sheonhan
commited on