MARS: Unleashing the Power of Variance Reduction for Training Large Models Paper β’ 2411.10438 β’ Published 6 days ago β’ 11
view article Article Halo: Open Source Health Tracking with Wearables By cyrilzakka β’ 2 days ago β’ 61
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion Paper β’ 2411.04928 β’ Published 14 days ago β’ 47
LLaVA-o1: Let Vision Language Models Reason Step-by-Step Paper β’ 2411.10440 β’ Published 6 days ago β’ 87
Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement Paper β’ 2411.06558 β’ Published 11 days ago β’ 29
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M β’ 10 items β’ Updated about 3 hours ago β’ 172
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models Paper β’ 2411.04905 β’ Published 14 days ago β’ 108
OpenCoder Collection OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. β’ 9 items β’ Updated 4 days ago β’ 70
JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation Paper β’ 2411.07975 β’ Published 9 days ago β’ 24
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 β’ 40 items β’ Updated 3 days ago β’ 223
Training-free Regional Prompting for Diffusion Transformers Paper β’ 2411.02395 β’ Published 17 days ago β’ 23
InstantIR: Blind Image Restoration with Instant Generative Reference Paper β’ 2410.06551 β’ Published Oct 9 β’ 6
Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent Paper β’ 2411.02265 β’ Published 17 days ago β’ 24