Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
kpyu
/
video-blip-flan-t5-xl-ego4d
like
3
Image-to-Text
Transformers
PyTorch
English
blip-2
text2text-generation
vision
video-to-text
image-captioning
video-captioning
visual-question-answering
Inference Endpoints
arxiv:
2301.12597
arxiv:
2210.11416
License:
mit
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
0db71ff
video-blip-flan-t5-xl-ego4d
File size: 21 Bytes
0db71ff
1
2
3
4
---
license:
mit
---