9 11 10

Xiangtai Li

LXT

https://lxtgh.github.io/

xtl994
lxtGH

AI & ML interests

Computer Vision, Multi-Modal Understanding, Generative AI

Recent Activity

liked a model 19 days ago

Collov-Labs/Monetico

upvoted a paper about 1 month ago

Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation

upvoted a paper about 1 month ago

Movie Gen: A Cast of Media Foundation Models

View all activity

Organizations

LXT's activity

liked a model 19 days ago

Collov-Labs/Monetico

Text-to-Image • Updated 26 days ago • 5.35k • 64

upvoted 2 papers about 1 month ago

Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation

Paper • 2410.13848 • Published Oct 17 • 27

Movie Gen: A Cast of Media Foundation Models

Paper • 2410.13720 • Published Oct 17 • 88

liked a Space about 1 month ago

Running on Zero

🚀

Meissonic Flow

authored 6 papers about 1 month ago

GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning

Paper • 2403.12003 • Published Mar 18 • 2

LLAVADI: What Matters For Multimodal Large Language Models Distillation

Paper • 2407.19409 • Published Jul 28

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

Paper • 2410.08261 • Published Oct 10 • 49

upvoted a paper about 1 month ago

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

Paper • 2410.08261 • Published Oct 10 • 49

commented a paper about 1 month ago

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

Paper • 2410.08261 • Published Oct 10 • 49 •

liked a model about 1 month ago

MeissonFlow/Meissonic

Text-to-Image • Updated 26 days ago • 1.82k • 96

updated a model 4 months ago

OMG-Research/OMG-LLaVA

Updated Aug 6 • 1

liked 2 models 4 months ago

zhangtao-whu/OMG-LLaVA

Updated Jul 3 • 4

PhoenixZ/MG-LLaVA

Updated Jun 26 • 7

upvoted a paper 4 months ago

Video Diffusion Alignment via Reward Gradients

Paper • 2407.08737 • Published Jul 11 • 47

updated a Space 5 months ago

No application file

📈

OMG LLaVA

commented 2 papers 5 months ago

OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding

Paper • 2406.19389 • Published Jun 27 • 51 •

OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding

Paper • 2406.19389 • Published Jun 27 • 51 •