List of leaderboard's and resources that I developed to evaluate LLM's in a Multilingual and Portuguese settings.
Eduardo Garcia
eduagarcia
AI & ML interests
Master's Student of Computer Science at UFG. AI Researcher on CEIA and Data Lawyer.
Organizations
Collections
3
A daily uploaded list of models with best evaluations on the PT-LLM leaderboard:
-
recogna-nlp/phibode_1_5_ultraalpaca
Text Generation β’ Updated β’ 45 β’ 3 -
h2oai/h2o-danube2-1.8b-base
Text Generation β’ Updated β’ 1.1k β’ 46 -
stabilityai/stablelm-2-zephyr-1_6b
Text Generation β’ Updated β’ 12.4k β’ 179 -
h2oai/h2o-danube3-4b-base
Text Generation β’ Updated β’ 7.25k β’ 19
models
2
datasets
18
eduagarcia/LegalPT_dedup
Viewer
β’
Updated
β’
23.9M
β’
17
β’
13
eduagarcia/LegalPT
Viewer
β’
Updated
β’
48.4M
β’
26
β’
8
eduagarcia/portuguese_benchmark
Viewer
β’
Updated
β’
78.1k
β’
370
β’
5
eduagarcia/FactNews
Viewer
β’
Updated
β’
22.1k
β’
6
β’
1
eduagarcia/sick-br
Viewer
β’
Updated
β’
9.84k
β’
8
β’
2
eduagarcia/dpo-mix-21k
Viewer
β’
Updated
β’
21k
β’
2
eduagarcia/pagico
Viewer
β’
Updated
β’
970k
β’
5
eduagarcia/MilkQA
Viewer
β’
Updated
β’
7.97k
β’
4
eduagarcia/tweetsentbr_fewshot
Viewer
β’
Updated
β’
2.09k
β’
157
eduagarcia/CrawlPT_dedup
Viewer
β’
Updated
β’
105M
β’
5
β’
4