portuguese-benchmark-datasets/BLUEX
Viewer
•
Updated
•
1.26k
•
177
•
11
Benchmark datasets for evaluating language models in Portuguese and assessing their knowledge about Brazil
We currently offer one dataset for evaluating the performance of language models on Brazilian Leading Universities Entrance eXams (BLUEX). If you use this dataset for research, please cite the paper:
@misc{almeida2023bluex,
title={BLUEX: A benchmark based on Brazilian Leading Universities Entrance eXams},
author={Thales Sales Almeida and Thiago Laitz and Giovana K. Bonás and Rodrigo Nogueira},
year={2023},
eprint={2307.05410},
archivePrefix={arXiv},
primaryClass={cs.CL}
}