Open-Sora Plan: Open-Source Large Video Generation Model Paper • 2412.00131 • Published 6 days ago • 22
Image / Video Gen Collection Image Generation Using Diffusion-Based Methods: Tips and Techniques for Stable Diffusion • 25 items • Updated 1 day ago • 6
Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations Paper • 2410.10792 • Published Oct 14 • 27
SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory Paper • 2411.11922 • Published 16 days ago • 17
Image / Video Gen Collection Image Generation Using Diffusion-Based Methods: Tips and Techniques for Stable Diffusion • 25 items • Updated 1 day ago • 6
OminiControl: Minimal and Universal Control for Diffusion Transformer Paper • 2411.15098 • Published 12 days ago • 42
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training Paper • 2411.15124 • Published 12 days ago • 55
Multimodal Language Model Collection What does matter besides data receipt when training a Multimodal language model? • 25 items • Updated 11 days ago • 1
Multimodal Autoregressive Pre-training of Large Vision Encoders Paper • 2411.14402 • Published 13 days ago • 40