Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation.