Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free Paper • 2410.10814 • Published Oct 14 • 48
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis Paper • 2410.08261 • Published Oct 10 • 49
DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation Paper • 2410.08159 • Published Oct 10 • 24
EvTexture: Event-driven Texture Enhancement for Video Super-Resolution Paper • 2406.13457 • Published Jun 19 • 16
Improving Generative Adversarial Networks for Video Super-Resolution Paper • 2406.16359 • Published Jun 24 • 1
Arbitrary-Scale Video Super-Resolution with Structural and Textural Priors Paper • 2407.09919 • Published Jul 13 • 1
VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control Paper • 2407.12781 • Published Jul 17 • 12
PALP: Prompt Aligned Personalization of Text-to-Image Models Paper • 2401.06105 • Published Jan 11 • 47
VideoDreamer: Customized Multi-Subject Text-to-Video Generation with Disen-Mix Finetuning Paper • 2311.00990 • Published Nov 2, 2023 • 2
High-fidelity Person-centric Subject-to-Image Synthesis Paper • 2311.10329 • Published Nov 17, 2023 • 1
HiFi Tuner: High-Fidelity Subject-Driven Fine-Tuning for Diffusion Models Paper • 2312.00079 • Published Nov 30, 2023 • 14