Pyramidal Flow Matching for Efficient Video Generative Modeling Paper • 2410.05954 • Published 8 days ago • 32
OmniBooth: Learning Latent Control for Image Synthesis with Multi-modal Instruction Paper • 2410.04932 • Published 9 days ago • 8
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction Paper • 2409.18124 • Published 20 days ago • 29
SyntheOcc: Synthesize Geometric-Controlled Street View Images through 3D Semantic MPIs Paper • 2410.00337 • Published 15 days ago • 10