Spaces:
AIR-Bench
/
Running on CPU Upgrade

leaderboard / tests

Commit History

test: add unit tests for envs
0af261c

nan commited on

test: add unit tests for columns
729aa2a

nan commited on

refactor: reformat
a3d4c8d

nan commited on

refactor: restructure the files
98e75e7

nan commited on

refactor: use enum class for the task type
6f9f649

nan commited on

refactor: reformat with black
ec8e2d4

nan commited on

feat: implement the version selector for qa
7845083

nan commited on

refactor: refactor the codes
32ee53f

nan commited on

refactor: rename the benchmarks enum
270c122

nan commited on

refactor: refactor the benchmarks
3fcf957

nan commited on

refactor: refactor the column settings
a7c0332

nan commited on

refactor: refactor the benchmarks
649e0fb

nan commited on

refactor: move the data model
4eb64b4

nan commited on

refactor: remove the unnecessary variables
592bb62

nan commited on

refactor: move the column names to a seperated file
e2d3123

nan commited on

fix-bug-in-show-details-0517 (#9)
7ca7624
verified

nan commited on

fix: fix the bug in duplicated columns
6d7eea4

nan commited on

refactor: remove the legacy directory
5eb510c

nan commited on

refactor: remove the legacy directory
b9c8c30

nan commited on

refactor: remove the legacy directory
8e1f9af

nan commited on

feat: add revision and timestamp information
df659d0

nan commited on

feat: use iso 8601 for timestamp
5664d71

nan commited on

feat: switch the default metric to ndcg_at_10
b33239d

nan commited on

feat: adapt to the latest data format
1a2dba5

nan commited on

chore: clean up
6c22ed3

nan commited on

chore: clean up the requests related codes
8a1daf9

nan commited on

chore: clean up the requests related codes
3b83af7

nan commited on

chore: clean up requests-related codes
e5c7cad

nan commited on

feat: fix the table updating
f30cbcc

nan commited on

feat: add metric selector
5808d8f

nan commited on

fix: fix the data loader
1e768ec

nan commited on

feat: adapt UI in app.py
e8879cc

nan commited on

feat: adapt the utils in app.py
9c49811

nan commited on

feat: fix the to_dict function
3d59d51

nan commited on

feat: add unittests
ea6034c

nan commited on

feat: seperate the qa and longdoc tasks
9134169

nan commited on

feat: adapt the data loading part
8b7a945

nan commited on