Visual Question Answering
Transformers
English
qwen2
text-generation
multimodal large language model
large video-language model
Inference Endpoints

Commit History

initial commit
6683b34
verified

lixin4ever commited on