5 4 2

David Chan

davidchan

https://dchan.cc

DavidMChan

AI & ML interests

Vision + Language

Recent Activity

upvoted a paper 15 days ago

Analyzing The Language of Visual Tokens

commented a paper 15 days ago

Analyzing The Language of Visual Tokens

New activity about 1 month ago

davidchan/anim400k:The archives are broken

View all activity

Articles

Are We Ready for Multi-Image Reasoning? Launching VHs: The Visual Haystacks Benchmark!

Jul 23

• 3

Organizations

davidchan's activity

upvoted a paper 15 days ago

Analyzing The Language of Visual Tokens

Paper • 2411.05001 • Published 16 days ago • 20

commented a paper 15 days ago

Analyzing The Language of Visual Tokens

Paper • 2411.05001 • Published 16 days ago • 20 •

New activity in davidchan/anim400k about 1 month ago

The archives are broken

#2 opened about 1 month ago by

lemuriandezapada

upvoted a paper 2 months ago

CLAIR-A: Leveraging Large Language Models to Judge Audio Captions

Paper • 2409.12962 • Published Sep 19 • 2

commented a paper 2 months ago

CLAIR-A: Leveraging Large Language Models to Judge Audio Captions

Paper • 2409.12962 • Published Sep 19 • 2 •

New activity in davidchan/anim400k 3 months ago

Dataset Viewer issue

#1 opened 5 months ago by

nanaxnana

commented a paper 4 months ago

Visual Haystacks: Answering Harder Questions About Sets of Images

Paper • 2407.13766 • Published Jul 18 • 2 •

authored 4 papers 4 months ago

posted an update 4 months ago

Post

542

🚨 Launching The Visual Haystacks (VHs) Benchmark: the first "visual-centric" Needle-In-A-Haystack (NIAH) benchmark to assess LMMs' capability in long-context visual retrieval and reasoning.

Check it out!
tsunghanwu/visual_haystacks
https://visual-haystacks.github.io/
https://arxiv.org/abs/2407.13766
https://github.com/visual-haystacks/vhs_benchmark

commented a paper 4 months ago

Visual Haystacks: Answering Harder Questions About Sets of Images

Paper • 2407.13766 • Published Jul 18 • 2 •

upvoted an article 4 months ago

Article

Are We Ready for Multi-Image Reasoning? Launching VHs: The Visual Haystacks Benchmark!

•

Jul 23

• 3

published an article 4 months ago

Article

Are We Ready for Multi-Image Reasoning? Launching VHs: The Visual Haystacks Benchmark!

•

Jul 23

• 3

liked a dataset 4 months ago

tsunghanwu/visual_haystacks

Viewer • Updated Oct 16 • 20.1k • 321 • 7

updated a dataset 5 months ago

davidchan/anim400k

Updated Jun 21 • 79 • 31

liked a dataset 10 months ago

davidchan/anim400k

Updated Jun 21 • 79 • 31

authored 2 papers 10 months ago

Task Oriented Dialogue as a Catalyst for Self-Supervised Automatic Speech Recognition

Paper • 2401.02417 • Published Jan 4 • 1

Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification

Paper • 2312.14378 • Published Dec 22, 2023