Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models Mar 20 β’ 67
SmolVLM Collection State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct β’ 5 items β’ Updated 7 days ago β’ 24
π» Local SmolLMs Collection SmolLM models in MLC, ONNX and GGUF format for local applications + in-browser demos β’ 14 items β’ Updated Aug 20 β’ 46
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper β’ 2406.17557 β’ Published Jun 25 β’ 86
Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations Paper β’ 2405.18392 β’ Published May 28 β’ 12
Leaderboards and benchmarks β¨ Collection Cool leaderboard spaces collection for models across modalities! Text, vision, audio, ... β’ 80 items β’ Updated 4 days ago β’ 90
ZeroGPU Spaces Collection ZeroGPU Spaces made by the community β’ 17 items β’ Updated Jun 6 β’ 231