The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery Paper • 2408.06292 • Published Aug 12 • 115
Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Sep 25 • 624
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community Apr 15 • 166
Heron-Bench: A Benchmark for Evaluating Vision Language Models in Japanese Paper • 2404.07824 • Published Apr 11 • 3
SigLIP Collection Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 • 10 items • Updated 4 days ago • 37
NuScenes-MQA: Integrated Evaluation of Captions and QA for Autonomous Driving Datasets using Markup Annotations Paper • 2312.06352 • Published Dec 11, 2023 • 1
Evaluation of Large Language Models for Decision Making in Autonomous Driving Paper • 2312.06351 • Published Dec 11, 2023 • 5
OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents Paper • 2306.16527 • Published Jun 21, 2023 • 47
BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs Paper • 2307.08581 • Published Jul 17, 2023 • 27