license: apache-2.0 | |
inference: false | |
datasets: | |
- Lin-Chen/ShareGPT4V | |
- ShareGPT4Video/ShareGPT4Video | |
pipeline_tag: visual-question-answering | |
<br> | |
<br> | |
# sharegpt4video-8b Model Card | |
## Model details | |
**Model type:** | |
sharegpt4video-8b is an open-source video chatbot trained by fine-tuning the entire model on open-source [video instruction data](https://huggingface.co/datasets/ShareGPT4Video/ShareGPT4Video). The training process takes around 5 hour on 32xA100 GPUs. | |
**Model date:** | |
sharegpt4video-8b was trained in May 2024. | |
**Paper or resources for more information:** | |
[[Code](https://github.com/ShareGPT4Omni/ShareGPT4Video)] [[Project Page](https://sharegpt4video.github.io/)] | |
## Usage | |
You can utilize this model as we provide in our [[repository](https://github.com/ShareGPT4Omni/ShareGPT4Video)]. | |
## Training dataset | |
All training data are open-sourced, you can find the usage in our [repository](https://github.com/ShareGPT4Omni/ShareGPT4Video). | |
- 153K collection of various video instruction data | |
- 28K high-quality video caption data from [[ShareGPT4Video](https://huggingface.co/datasets/ShareGPT4Video/ShareGPT4Video)] | |
## Intended use | |
**Primary intended uses:** | |
The primary use of sharegpt4video-8b is research on large video-language models and video chatbots. | |
**Primary intended users:** | |
The primary intended users of the model are researchers and hobbyists in computer vision, natural language processing, machine learning, and artificial intelligence. | |
## Paper | |
arxiv.org/abs/2406.04325 | |