Stylecodes: Encoding Stylistic Information For Image Generation Paper • 2411.12811 • Published 2 days ago • 6
VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models Paper • 2411.13503 • Published 1 day ago • 23
Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents Paper • 2411.06559 • Published 11 days ago • 9
SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory Paper • 2411.11922 • Published 4 days ago • 12
SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration Paper • 2411.10958 • Published 5 days ago • 35
Drowning in Documents: Consequences of Scaling Reranker Inference Paper • 2411.11767 • Published 3 days ago • 16
SmoothCache: A Universal Inference Acceleration Technique for Diffusion Transformers Paper • 2411.10510 • Published 6 days ago • 8
BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices Paper • 2411.10640 • Published 6 days ago • 37
AnimateAnything: Consistent and Controllable Animation for Video Generation Paper • 2411.10836 • Published 5 days ago • 18
Awaker2.5-VL: Stably Scaling MLLMs with Parameter-Efficient Mixture of Experts Paper • 2411.10669 • Published 6 days ago • 9
LLaVA-o1: Let Vision Language Models Reason Step-by-Step Paper • 2411.10440 • Published 6 days ago • 88
Can sparse autoencoders be used to decompose and interpret steering vectors? Paper • 2411.08790 • Published 8 days ago • 8
Scaling Properties of Diffusion Models for Perceptual Tasks Paper • 2411.08034 • Published 9 days ago • 13
JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation Paper • 2411.07975 • Published 9 days ago • 24
Game-theoretic LLM: Agent Workflow for Negotiation Games Paper • 2411.05990 • Published 13 days ago • 6
IOPO: Empowering LLMs with Complex Instruction Following via Input-Output Preference Optimization Paper • 2411.06208 • Published 12 days ago • 18
Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models Paper • 2411.07126 • Published 10 days ago • 28