Yuxuan Wang's picture

5 4 2

Yuxuan Wang

ColorfulAI

·

https://patrick-tssn.github.io/

patrick-tssn

AI & ML interests

Multimodal Learning

Recent Activity

commented a paper 15 days ago

VideoLLM Knows When to Speak: Enhancing Time-Sensitive Video Comprehension with Video-Text Duet Interaction Format

upvoted a paper 15 days ago

VideoLLM Knows When to Speak: Enhancing Time-Sensitive Video Comprehension with Video-Text Duet Interaction Format

updated a model about 2 months ago

ColorfulAI/NeedleInAVideoHaystack

View all activity

Organizations

Papers 7

arxiv:2411.17991

arxiv:2409.01151

arxiv:2409.01071

arxiv:2408.02210

models 4

ColorfulAI/NeedleInAVideoHaystack

ColorfulAI/videollamb-llava-1.5-7b

Video-Text-to-Text • Updated Sep 9 • 20 • 4

ColorfulAI/videollamb-mem-llava-1.5-7b

Updated Aug 12 • 7

ColorfulAI/LSTP-Chat

Image-Text-to-Text • Updated Aug 2 • 4

datasets 3

ColorfulAI/NeedleInAVideoHaystack

Updated Oct 16 • 5

ColorfulAI/EgoPlan_test

Viewer • Updated Sep 15 • 923 • 155

ColorfulAI/VideoLLaMB-IT

Viewer • Updated Aug 12 • 1.03M • 49