-
One-for-All: Generalized LoRA for Parameter-Efficient Fine-tuning
Paper • 2306.07967 • Published • 24 -
Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
Paper • 2306.07954 • Published • 111 -
TryOnDiffusion: A Tale of Two UNets
Paper • 2306.08276 • Published • 71 -
Seeing the World through Your Eyes
Paper • 2306.09348 • Published • 31
Collections
Discover the best community collections!
Collections including paper arxiv:2404.11925
-
EdgeFusion: On-Device Text-to-Image Generation
Paper • 2404.11925 • Published • 20 -
Dynamic Typography: Bringing Words to Life
Paper • 2404.11614 • Published • 40 -
ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback
Paper • 2404.07987 • Published • 46 -
Applying Guidance in a Limited Interval Improves Sample and Distribution Quality in Diffusion Models
Paper • 2404.07724 • Published • 10
-
EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters
Paper • 2402.04252 • Published • 21 -
MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual Tokens
Paper • 2404.03413 • Published • 22 -
openai/clip-vit-large-patch14-336
Zero-Shot Image Classification • Updated • 5.8M • • 149 -
openai/clip-vit-base-patch32
Zero-Shot Image Classification • Updated • 15.7M • 411