Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
kpyu
/
video-blip-flan-t5-xl-ego4d
like
3
Image-to-Text
Transformers
PyTorch
English
blip-2
text2text-generation
vision
video-to-text
image-captioning
video-captioning
visual-question-answering
Inference Endpoints
arxiv:
2301.12597
arxiv:
2210.11416
License:
mit
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
main
video-blip-flan-t5-xl-ego4d
/
pytorch_model-00001-of-00002.bin
Commit History
Upload VideoBlipForConditionalGeneration
3366606
kpyu
commited on
May 17, 2023