FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion Model Paper • 2410.13925 • Published 15 days ago • 21
BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities Paper • 2410.14672 • Published 14 days ago • 7
Scalable Ranked Preference Optimization for Text-to-Image Generation Paper • 2410.18013 • Published 9 days ago • 14
DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation Paper • 2410.18666 • Published 8 days ago • 17