SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration Paper • 2411.10958 • Published 5 days ago • 40
SymDPO: Boosting In-Context Learning of Large Multimodal Models with Symbol Demonstration Direct Preference Optimization Paper • 2411.11909 • Published 5 days ago • 18
RedPajama: an Open Dataset for Training Large Language Models Paper • 2411.12372 • Published 3 days ago • 40
Drowning in Documents: Consequences of Scaling Reranker Inference Paper • 2411.11767 • Published 4 days ago • 16
LLaVA-o1: Let Vision Language Models Reason Step-by-Step Paper • 2411.10440 • Published 7 days ago • 90
Large Language Models Can Self-Improve in Long-context Reasoning Paper • 2411.08147 • Published 10 days ago • 59
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models Paper • 2411.04905 • Published 15 days ago • 108
What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective Paper • 2410.23743 • Published 22 days ago • 59
Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss Paper • 2410.17243 • Published about 1 month ago • 88
Aligning Large Language Models via Self-Steering Optimization Paper • 2410.17131 • Published about 1 month ago • 21
Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens Paper • 2410.13863 • Published Oct 17 • 35
WALL-E: World Alignment by Rule Learning Improves World Model-based LLM Agents Paper • 2410.07484 • Published Oct 9 • 48
TLDR: Token-Level Detective Reward Model for Large Vision Language Models Paper • 2410.04734 • Published Oct 7 • 16