Flo Schneider's picture

Flo Schneider

floschne

·

https://www.inf.uni-hamburg.de/en/inst/ab/lt/people/florian-schneider.html

floschne

AI & ML interests

Multi Modal Information Retrieval and Representation Learning

Recent Activity

New activity 22 days ago

neulab/PangeaBench-xmmmu

upvoted a paper about 1 month ago

Organizations

floschne's activity

upvoted a paper about 1 month ago

Aria: An Open Multimodal Native Mixture-of-Experts Model

Paper • 2410.05993 • Published Oct 8 • 107

upvoted a collection 2 months ago

LLaVA-Onevision

LLaVa_Onevision models for single-image, multi-image, and video scenarios • 9 items • Updated Sep 18 • 12

upvoted an article 2 months ago

Article

Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Language Model

Aug 22, 2023

• 27

upvoted a paper 2 months ago

M5 -- A Diverse Benchmark to Assess the Performance of Large Multimodal Models Across Multilingual and Multicultural Vision-Language Tasks

Paper • 2407.03791 • Published Jul 4 • 1

upvoted a paper 3 months ago

LLaVA-OneVision: Easy Visual Task Transfer

Paper • 2408.03326 • Published Aug 6 • 59

upvoted a paper 5 months ago

What If We Recaption Billions of Web Images with LLaMA-3?

Paper • 2406.08478 • Published Jun 12 • 39

upvoted a paper 6 months ago

Matryoshka Multimodal Models

Paper • 2405.17430 • Published May 27 • 31

upvoted 2 papers 8 months ago

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Paper • 2403.09611 • Published Mar 14 • 124

Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models

Paper • 2403.18814 • Published Mar 27 • 44