Jianshu Zhang's picture

4 9 25

Jianshu Zhang

Sterzhang

·

https://sterzhang.github.io/

AI & ML interests

Data-Centric AI, Multi-Modal Understanding

Recent Activity

New activity 1 day ago

luoruipu1/Valley-Instruct-65k:How to download all the video?

updated a dataset 9 days ago

Sterzhang/tmp

updated a dataset 11 days ago

Sterzhang/tmp1

View all activity

Organizations

None yet

Sterzhang's activity

upvoted a paper 24 days ago

ReferEverything: Towards Segmenting Everything We Can Speak of in Videos

Paper • 2410.23287 • Published 25 days ago • 17

upvoted 3 papers 27 days ago

Can MLLMs Understand the Deep Implication Behind Chinese Images?

Paper • 2410.13854 • Published Oct 17 • 8

DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control

Paper • 2410.13830 • Published Oct 17 • 23

Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens

Paper • 2410.13863 • Published Oct 17 • 35

upvoted 2 papers 28 days ago

SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree

Paper • 2410.16268 • Published Oct 21 • 65

ROCKET-1: Master Open-World Interaction with Visual-Temporal Context Prompting

Paper • 2410.17856 • Published Oct 23 • 49

upvoted a paper about 1 month ago

MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models

Paper • 2410.17637 • Published Oct 23 • 34

upvoted a paper about 2 months ago

Personalized Visual Instruction Tuning

Paper • 2410.07113 • Published Oct 9 • 69

upvoted a paper 5 months ago

Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions

Paper • 2406.07502 • Published Jun 11 • 1