ByteDance
/

shot2story

Visual Question Answering

Model card Files Files and versions Community

mhan commited on Apr 25

Commit

5bd2e88

•

1 Parent(s): c13efb5

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -13,12 +13,12 @@ pipeline_tag: visual-question-answering
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/641ae9911911d3be67422e6f/0KwEa8cvg0KEq7wLmhpLz.png)
-## Dataset Description
 - **Repository:** [Shot2Story](https://github.com/bytedance/Shot2Story)
 - **Paper:** [2312.10300](https://arxiv.org/abs/2312.10300)
 - **Point of Contact:** mailto:[Mingfei Han]([email protected])
 **For video data downloading, please have a look at [this issue](https://github.com/bytedance/Shot2Story/issues/5).**
 We are excited to release a new video-text benchmark for multi-shot video understanding. This release contains a 134k version of our dataset. It includes detailed long summaries (human annotated + GPTV generated) for 134k videos and shot captions (human annotated) for 188k video shots. Please check the dataset [here](https://huggingface.co/datasets/mhan/Shot2Story-134K).

 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/641ae9911911d3be67422e6f/0KwEa8cvg0KEq7wLmhpLz.png)
 - **Repository:** [Shot2Story](https://github.com/bytedance/Shot2Story)
 - **Paper:** [2312.10300](https://arxiv.org/abs/2312.10300)
 - **Point of Contact:** mailto:[Mingfei Han]([email protected])
+## Training Dataset
 **For video data downloading, please have a look at [this issue](https://github.com/bytedance/Shot2Story/issues/5).**
 We are excited to release a new video-text benchmark for multi-shot video understanding. This release contains a 134k version of our dataset. It includes detailed long summaries (human annotated + GPTV generated) for 134k videos and shot captions (human annotated) for 188k video shots. Please check the dataset [here](https://huggingface.co/datasets/mhan/Shot2Story-134K).