Evaluating Tokenizer Performance of Large Language Models Across Official Indian Languages Paper • 2411.12240 • Published 23 days ago • 6
LLäMmlein: Compact and Competitive German-Only Language Models from Scratch Paper • 2411.11171 • Published 25 days ago • 8
Marco-LLM: Bridging Languages via Massive Multilingual Training for Cross-Lingual Enhancement Paper • 2412.04003 • Published 7 days ago • 9