Nick's picture

2

Nick

NBStephens

·

AI & ML interests

Vision transformers, multimodal, object detection

Organizations

None yet

NBStephens's activity

upvoted 2 papers about 1 year ago

Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution

Paper • 2307.06304 • Published Jul 12, 2023 • 26

Kosmos-2: Grounding Multimodal Large Language Models to the World

Paper • 2306.14824 • Published Jun 26, 2023 • 34