TIGER-Lab/Mantis-8B-Idefics2
Updated
•
4.12k
•
10
Mantis model family optimized for multi-image reasoning with interleaved text/image format
Note Current SoTA Mantis variant
Note Current SoTA Mantis variant without multi-image pre-training
Note Our training dataset
Note Curated evaluation benchmark for multi-image scenarios
Multimodal Language Model