VLLMs LLaVA-OneVision: Easy Visual Task Transfer Paper • 2408.03326 • Published Aug 6 • 59 MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models Paper • 2408.02718 • Published Aug 5 • 60 NVLM: Open Frontier-Class Multimodal LLMs Paper • 2409.11402 • Published 15 days ago • 65
MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models Paper • 2408.02718 • Published Aug 5 • 60