Pspets multimodal GPT-4V(ision) is a Generalist Web Agent, if Grounded Paper • 2401.01614 • Published Jan 3 • 21
MultiModal Models Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks Paper • 2311.06242 • Published Nov 10, 2023 • 84
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks Paper • 2311.06242 • Published Nov 10, 2023 • 84