Investing in Performance: Fine-tune small models with LLM insights - a CFM case study about 17 hours ago โข 5
Banque des Territoires (CDC Group) x Polyconseil x Hugging Face: Enhancing a Major French Environmental Program with a Sovereign Data Solution Jul 9 โข 5
view article Article Banque des Territoires (CDC Group) x Polyconseil x Hugging Face: Enhancing a Major French Environmental Program with a Sovereign Data Solution Jul 9 โข 5
view article Article From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate Jun 13 โข 44
๐ซ๐ท Cross-encoder rerankers Collection A collection of cross-encoder reranking models in French. โข 31 items โข Updated Oct 4 โข 7
DiscoLeo 8B: Llama3 for German Collection Continued Pretraining on Llama3 8B to improve German linguistic capabilities. A collection of base and fine-tuned models and variants. โข 5 items โข Updated May 25 โข 16
Albert Collection Les diffรฉrents modรจles ร jour dans la famille Albert, les modรจles archivรฉs n'apparaissent pas dans cette collection. The various models behind Albert โข 14 items โข Updated Oct 16 โข 7
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases โข 5 items โข Updated Sep 25 โข 685
MoAI: Mixture of All Intelligence for Large Language and Vision Models Paper โข 2403.07508 โข Published Mar 12 โข 75
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper โข 2402.17764 โข Published Feb 27 โข 603
Mamba: Linear-Time Sequence Modeling with Selective State Spaces Paper โข 2312.00752 โข Published Dec 1, 2023 โข 138
FrenchBench Evaluation datasets Collection These datasets are used to evaluate models on French performance using: https://github.com/EleutherAI/lm-evaluation-harness (from CroissantLLM paper) โข 11 items โข Updated Jun 7 โข 4
ML for Tools Collection Collection of papers about ML for using tools! โข 25 items โข Updated Jan 17 โข 9