Multi-image Inference

#10

by annabavaresco - opened Oct 11

Oct 11

Hi, I was wondering if this version of Molmo supports multi-image inference and - if so - what's the correct way of processing the inputs. Thanks in advance!

gregjanik

Oct 12

It does not at the moment.

annabavaresco

Oct 14

I see, thanks for your reply!

Malvinan

4 days ago

As a follow up, could you explain how you evaluate on MMMU? Doesn't it contain interleaved image-text data?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment