13 14 6

Weihao Yu

whyu

https://scholar.google.com/citations?user=LYxjt1QAAAAJ

AI & ML interests

Computer Vision, NLP and AI

Recent Activity

upvoted a paper 3 days ago

OminiControl: Minimal and Universal Control for Diffusion Transformer

New activity 24 days ago

whyu/MM-Vet_Evaluator:Inquiry About Model API for Answer Post-Processing

upvoted an article about 1 month ago

Mamba Out

View all activity

Organizations

whyu's activity

upvoted a paper 3 days ago

OminiControl: Minimal and Universal Control for Diffusion Transformer

Paper • 2411.15098 • Published 5 days ago • 37

upvoted an article about 1 month ago

Article

Mamba Out

•

Oct 18

• 8

upvoted a paper about 1 month ago

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

Paper • 2410.10818 • Published Oct 14 • 14

upvoted a paper about 2 months ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7 • 166

upvoted a paper 2 months ago

Attention Prompting on Image for Large Vision-Language Models

Paper • 2409.17143 • Published Sep 25 • 7

upvoted 3 papers 3 months ago

upvoted a paper 4 months ago

MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities

Paper • 2408.00765 • Published Aug 1 • 12

upvoted an article 4 months ago

Article

MobileNet Baselines

•

Jul 26

• 23

upvoted a paper 4 months ago

KAN or MLP: A Fairer Comparison

Paper • 2407.16674 • Published Jul 23 • 42

upvoted 2 papers 5 months ago

Compositional Video Generation as Flow Equalization

Paper • 2407.06182 • Published Jun 10 • 12

Video-Infinity: Distributed Long Video Generation

Paper • 2406.16260 • Published Jun 24 • 28

upvoted a paper about 1 year ago

Exponentially Faster Language Modelling

Paper • 2311.10770 • Published Nov 15, 2023 • 118