Visual Question Answering
GuanacoVQA / README.md
JosephusCheung's picture
Update README.md
01875b9
|
raw
history blame
622 Bytes
metadata
license: gpl-3.0
datasets:
  - JosephusCheung/GuanacoVQADataset
language:
  - en
  - zh
  - ja
  - de
pipeline_tag: visual-question-answering

The following content is currently a work in progress and does not represent the final quality.

Alignment for the multilingual VQA tasks is being conducted on blip2-flan-t5-xxl and Guanaco using only Linear Layers.

The latest weight file is provided here, based on the implementation of MiniGPT-4.

This model supports English, Chinese, Japanese, and German languages and requires the combined use of the Guanaco 7B LLM model.

A portion of the dataset has already been released.