open_ita_llm_leaderboard / leaderboard_general.csv
FinancialSupport's picture
Update leaderboard_general.csv
d9b5522 verified
raw
history blame
1.68 kB
model ,m_mmlu_it acc shot 3,m_mmlu_it acc shot 5,m_mmlu_it acc shot 0,belebele_ita_Latn acc,belebele_ita_Latn acc norm,helloswag_it acc,helloswag_it acc norm,lambada_openai_mt_it perplexity,lambada_openai_mt_it acc,xcopa_it acc,arc_it acc,arc_it acc norm
giux78/zefiro-7b-sft-qlora-ITA-v0.5,0.5196,0.5246,0.4762,0.4656,0.4656,0.4636,0.6097,22.5232,0.5154,0.67,0.1642,0.4397
mii-llm/maestrale-chat-v0.2-alpha,0.519,0.5163,0.4682,0.4678,0.4678,0.519,0.6852,26.0037,0.4987,0.722,0.1206,0.4585
FinancialSupport/saiga-7b,0.4973,0.4933,0.4982,0.5222,0.5222,0.4824,0.6342,30.2369,0.4671,0.672,0.16,0.4748
giux78/zefiro-7b-beta-ITA-v0.1,0.5297,0.5203,0.4716,0.45,0.45,0.4607,0.6129,25.8213,0.5013,0.666,0.0838,0.4294
raicritis/Hermes7b_ITA,,0.3574,0.3381,0.3689,0.3689,0.4112,0.5407,34.7106,0.4677,0.66,0.1249,0.3524
DeepMount/Mistral-Ita-7b,,0.3879,0.3538,0.38,0.38,0.3978,0.5123,89.99,0.3361,0.592,0,0.3747
galatolo/cerbero-7B,,0.5137,0.4867,0.5089,0.5089,0.4722,0.6135,23.4551,0.4964,0.672,0.1001,0.4465
mii-11m/maestrale-chat-v0.3-alpha,,0.5164,0.4774,0.5911,0.5911,0.5046,0.66,38.2427,0.4378,0.692,0.1343,0.4568
giux78/zefiro-7b-dpo-qlora-ITA-v0.7,0.508,0.5203,0.4717,0.4778,0.4778,0.4914,0.6428,23.6041,0.5174,0.684,0.1805,0.4611
mii-llm/maestrale-chat-v0.3-beta,,0.5129,,0.5644,0.5644,0.5067,0.6581,53.0646,0.4207,0.72,0.1463,0.4559
swap-uniba/LLaMAntino-2-7b-hf-ITA,,0.3696,,0.2433,0.2433,0.4113,0.5428,33.6146,0.4696,0.678,0.139,0.3456
mistralai/Mistral-7B-v0.1,,0.5253,,0.41,0.41,0.4486,0.6122,30.2635,0.4894,0.658,0.1061,0.4149
swap-uniba/LLaMAntino-2-70b-hf-UltraChat-ITA,,0.5991,,,,0.5027,0.6506,,,,0.2464,0.4953
MoxoffSpA/Azzurro,,0.5084,,,,0.5027,0.6074,,,,0.1497,0.4414