94 25 20

Wenhu Chen

wenhu

https://wenhuchen.github.io

AI & ML interests

NLP

Recent Activity

updated a Space 4 days ago

TIGER-Lab/MEGA-Bench

updated a dataset 4 days ago

TIGER-Lab/MEGA-Bench

New activity 6 days ago

TIGER-Lab/Fineweb-Instruct:[bot] Conversion to Parquet

View all activity

Organizations

wenhu's activity

upvoted a paper 12 days ago

OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision

Paper • 2411.07199 • Published 13 days ago • 43

upvoted 2 papers about 1 month ago

MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks

Paper • 2410.10563 • Published Oct 14 • 37

VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks

Paper • 2410.05160 • Published Oct 7 • 4

upvoted a paper 3 months ago

Foundation Models for Music: A Survey

Paper • 2408.14340 • Published Aug 26 • 42

upvoted 2 papers 5 months ago

LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs

Paper • 2406.15319 • Published Jun 21 • 61

Unifying Multimodal Retrieval via Document Screenshot Embedding

Paper • 2406.11251 • Published Jun 17 • 9

upvoted 3 papers 6 months ago

GenAI Arena: An Open Evaluation Platform for Generative Models

Paper • 2406.04485 • Published Jun 6 • 20

T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model with Mixed Reward Feedback

Paper • 2405.18750 • Published May 29 • 21

MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark

Paper • 2406.01574 • Published Jun 3 • 43

upvoted a paper 7 months ago

MANTIS: Interleaved Multi-Image Instruction Tuning

Paper • 2405.01483 • Published May 2 • 6

upvoted 3 papers 8 months ago

upvoted a collection 9 months ago

StructLM

Collection

The structure knowledge grounded language model • 6 items • Updated Apr 6 • 7

upvoted 2 papers 9 months ago

ChatMusician: Understanding and Generating Music Intrinsically with LLM

Paper • 2402.16153 • Published Feb 25 • 56

StructLM: Towards Building Generalist Models for Structured Knowledge Grounding

Paper • 2402.16671 • Published Feb 26 • 26

upvoted 3 papers 10 months ago

ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation

Paper • 2402.04324 • Published Feb 6 • 23

CMMMU: A Chinese Massive Multi-discipline Multimodal Understanding Benchmark

Paper • 2401.11944 • Published Jan 22 • 27

E^2-LLM: Efficient and Extreme Length Extension of Large Language Models

Paper • 2401.06951 • Published Jan 13 • 25

upvoted a paper 11 months ago

Instruct-Imagen: Image Generation with Multi-modal Instruction

Paper • 2401.01952 • Published Jan 3 • 30