view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais • 8 days ago • 94
view article Article ColPali: Efficient Document Retrieval with Vision Language Models 👀 By manu • Jul 5 • 161
LangBridge: Multilingual Reasoning Without Multilingual Supervision Paper • 2401.10695 • Published Jan 19 • 5
view article Article OCR Processing and Text in Image Analysis with Florence-2-base and Qwen2-VL-2B By PandorAI1995 • Oct 18 • 13
view article Article Model2Vec: Distill a Small Fast Model from any Sentence Transformer By Pringled • Oct 14 • 55
EU20-Benchmarks Collection Evaluation Benchmarks for 20 European languages. • 5 items • Updated Oct 11 • 4
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching Paper • 2410.06885 • Published Oct 9 • 40
view article Article Improving performance with Arena Learning in post training By satpalsr • Sep 11 • 5
view article Article Perspectives for first principles prompt engineering By KnutJaegersberg • Aug 18 • 16