Ziwei Liu's picture

Ziwei Liu

liuziwei7

·

https://liuziwei7.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper about 6 hours ago

upvoted a paper 16 days ago

authored a paper 24 days ago

Organizations

liuziwei7's activity

upvoted a paper about 6 hours ago

VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models

Paper • 2411.13503 • Published 1 day ago • 22

upvoted a paper 16 days ago

MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D

Paper • 2411.02336 • Published 17 days ago • 23

upvoted a paper 24 days ago

FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality

Paper • 2410.19355 • Published 28 days ago • 23

upvoted a paper about 1 month ago

Disco4D: Disentangled 4D Human Generation and Animation from a Single Image

Paper • 2409.17280 • Published Sep 25 • 9

upvoted 2 collections about 2 months ago

LMMs-Eval-Lite

Making Lite version of the dataset to accelerate holistic evaluation during model development! • 20 items • Updated Oct 4 • 1

LLaVA-OneVision

a model good at arbitrary types of visual input • 15 items • Updated Oct 5 • 20

upvoted a paper about 2 months ago

Video Instruction Tuning With Synthetic Data

Paper • 2410.02713 • Published Oct 3 • 37

upvoted a collection about 2 months ago

Oryx

Oryx: One Multi-Modal LLM for On-Demand Spatial-Temporal Understanding • 5 items • Updated 29 days ago • 14

upvoted 3 papers 2 months ago

Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution

Paper • 2409.12961 • Published Sep 19 • 24

3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion

Paper • 2409.12957 • Published Sep 19 • 18

Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion

Paper • 2409.11406 • Published Sep 17 • 25

upvoted 5 papers 4 months ago

ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer

Paper • 2408.03284 • Published Aug 6 • 10

LLaVA-OneVision: Easy Visual Task Transfer

Paper • 2408.03326 • Published Aug 6 • 59

LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models

Paper • 2407.12772 • Published Jul 17 • 33

CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation

Paper • 2407.06188 • Published Jul 8 • 1

VEnhancer: Generative Space-Time Enhancement for Video Generation

Paper • 2407.07667 • Published Jul 10 • 14

upvoted a paper 5 months ago

FreeTraj: Tuning-Free Trajectory Control in Video Diffusion Models

Paper • 2406.16863 • Published Jun 24 • 10

upvoted 3 collections 5 months ago

LMMs-Eval

Dataset Collection of LMMs-Eval • 36 items • Updated Oct 4 • 25

LLaVA-Video

Models focus on video understanding (previously known as LLaVA-NeXT-Video). • 6 items • Updated Oct 5 • 53

LongVA

Long Context Transfer From Text To Vision: https://lmms-lab.github.io/posts/longva/ • 5 items • Updated Oct 4 • 12