Collections

Discover the best community collections!

Collections including paper arxiv:2310.03744
Vision Language Models Papers 🖼️💬📝
Papers about vision-language models, most important ones are on top of the list.
Multimodal
Collection by 19 days ago
Multimodal Papers
Collection by Apr 22
LLaVa-NeXT
LLaVa-NeXT (also known as LLaVa-1.6) improves upon the 1.5 series by incorporating higher image resolutions and more reasoning/OCR datasets.