Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
microsoft
/
OmniParser
like
834
Follow
Microsoft
4,461
Image-Text-to-Text
Transformers
Safetensors
blip-2
visual-question-answering
Inference Endpoints
arxiv:
2408.00203
License:
mit
Model card
Files
Files and versions
Community
11
Train
Deploy
Use this model
b3c8683
OmniParser
3 contributors
History:
7 commits
jacklangerman
Create LICENSE
b3c8683
verified
8 days ago
icon_caption_blip2
upload
22 days ago
icon_detect
upload
22 days ago
.gitattributes
1.52 kB
initial commit
24 days ago
LICENSE
1.11 kB
Create LICENSE
8 days ago
README.md
2.8 kB
update
9 days ago
config.json
985 Bytes
update
9 days ago