Visual Question Answering
Transformers
Safetensors
English
videollama2_mistral
text-generation
multimodal large language model
large video-language model
Inference Endpoints
File size: 132 Bytes
e4f6f68
 
 
 
 
 
 
1
2
3
4
5
6
7
8
{
  "_from_model_config": true,
  "bos_token_id": 1,
  "do_sample": true,
  "eos_token_id": 2,
  "transformers_version": "4.37.2"
}