Salman Khan's picture

8 7

Salman Khan

salmaneme

·

AI & ML interests

None yet

Organizations

salmaneme's activity

upvoted 6 collections 7 months ago

Satmae++

Collection of ViT models trained using SatMAE++ approach. • 4 items • Updated Jun 11 • 1

GeoChat

GeoChat is the first grounded Large Vision Language Model, specifically tailored to Remote Sensing(RS) scenarios. • 4 items • Updated Jun 11 • 4

MobiLlama

Collection of MobiLlama Language Models. • 6 items • Updated Jun 11 • 14

GLaMM

Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated. • 9 items • Updated Jun 11 • 4

Video-ChatGPT

"Video-ChatGPT" is a video conversation model capable of generating meaningful conversation about videos. • 2 items • Updated Jun 11 • 2

LLaVA++ (LLaMA-3 and Phi-3-Mini)

Extending Visual Capabilities of LLaVA with LLaMA-3 and Phi-3 • 11 items • Updated Jun 11 • 23

upvoted a paper 9 months ago

PALO: A Polyglot Large Multimodal Model for 5B People

Paper • 2402.14818 • Published Feb 22 • 23

upvoted a paper about 1 year ago

GLaMM: Pixel Grounding Large Multimodal Model

Paper • 2311.03356 • Published Nov 6, 2023 • 33