Malav Warke

malavwarke

https://www.creatosaurus.io/

AI & ML interests

https://www.creatosaurus.io/

Recent Activity

upvoted a paper about 22 hours ago

Stylecodes: Encoding Stylistic Information For Image Generation

liked a model 4 days ago

AlonzoLeeeooo/StableV2V

upvoted a paper 4 days ago

StableV2V: Stablizing Shape Consistency in Video-to-Video Editing

View all activity

Organizations

malavwarke's activity

upvoted a paper about 22 hours ago

Stylecodes: Encoding Stylistic Information For Image Generation

Paper • 2411.12811 • Published 4 days ago • 7

upvoted a paper 4 days ago

StableV2V: Stablizing Shape Consistency in Video-to-Video Editing

Paper • 2411.11045 • Published 7 days ago • 9

upvoted a paper 9 days ago

MagicQuill: An Intelligent Interactive Image Editing System

Paper • 2411.09703 • Published 9 days ago • 52

upvoted a paper 18 days ago

Adaptive Caching for Faster Video Generation with Diffusion Transformers

Paper • 2411.02397 • Published 19 days ago • 20

upvoted 2 papers about 1 month ago

DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control

Paper • 2410.13830 • Published Oct 17 • 23

DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation

Paper • 2410.08159 • Published Oct 10 • 24

upvoted 4 papers about 2 months ago

A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation

Paper • 2410.01912 • Published Oct 2 • 13

VideoGuide: Improving Video Diffusion Models without Training Through a Teacher's Guide

Paper • 2410.04364 • Published Oct 6 • 27

Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction

Paper • 2409.18124 • Published Sep 26 • 31

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25 • 103

upvoted 2 papers 2 months ago

Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think

Paper • 2409.11355 • Published Sep 17 • 28

OmniGen: Unified Image Generation

Paper • 2409.11340 • Published Sep 17 • 108

upvoted an article 6 months ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

May 14

• 210

upvoted a collection 8 months ago

DBRX

Collection

DBRX is a mixture-of-experts (MoE) large language model trained from scratch by Databricks. • 3 items • Updated Mar 27 • 91