Spaces:
Sleeping
Sleeping
Link Space to the paper
Browse filesHi,
Niels here from the open-source team at HF. I discovered your work at https://huggingface.co/papers/2408.03281, would be great to link it to the paper.
Cheers,
Niels
README.md
CHANGED
@@ -41,4 +41,18 @@ If you encounter problem on the space, don't hesitate to restart it to remove th
|
|
41 |
You'll find
|
42 |
- the main table' columns names and properties in `src/display/utils.py`
|
43 |
- the logic to read all results and request files, then convert them in dataframe lines, in `src/leaderboard/read_evals.py`, and `src/populate.py`
|
44 |
-
- teh logic to allow or filter submissions in `src/submission/submit.py` and `src/submission/check_validity.py`
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
41 |
You'll find
|
42 |
- the main table' columns names and properties in `src/display/utils.py`
|
43 |
- the logic to read all results and request files, then convert them in dataframe lines, in `src/leaderboard/read_evals.py`, and `src/populate.py`
|
44 |
+
- teh logic to allow or filter submissions in `src/submission/submit.py` and `src/submission/check_validity.py`
|
45 |
+
|
46 |
+
# Citation
|
47 |
+
|
48 |
+
```bibtex
|
49 |
+
@misc{cao2024structevaldeepenbroadenlarge,
|
50 |
+
title={StructEval: Deepen and Broaden Large Language Model Assessment via Structured Evaluation},
|
51 |
+
author={Boxi Cao and Mengjie Ren and Hongyu Lin and Xianpei Han and Feng Zhang and Junfeng Zhan and Le Sun},
|
52 |
+
year={2024},
|
53 |
+
eprint={2408.03281},
|
54 |
+
archivePrefix={arXiv},
|
55 |
+
primaryClass={cs.CL},
|
56 |
+
url={https://arxiv.org/abs/2408.03281},
|
57 |
+
}
|
58 |
+
```
|