OpenGVLab
/

InternVideo2_Chat_8B_InternLM2_5

Video-Text-to-Text

Model card Files Files and versions Community

ynhe commited on Aug 21

Commit

be1e3df

•

1 Parent(s): 6575e69

Update README.md

Files changed (1) hide show

README.md +9 -0

README.md CHANGED Viewed

@@ -20,6 +20,15 @@ communicating with open-sourced LLM. In training, the video encoder will be upda
 The BaseLLM of this model is [InternLM2.5-7B](https://huggingface.co/internlm/internlm2_5-7b-chat-1m) with 1M long context window.
 ## 🚀 How to use the model
 1. make sure to have `transformers >= 4.38.0, peft==0.5.0`

 The BaseLLM of this model is [InternLM2.5-7B](https://huggingface.co/internlm/internlm2_5-7b-chat-1m) with 1M long context window.
+## 📈 Performance
+| Model |  MVBench | VideoMME(w/o sub)|
+| ---   |  ---     |   ---            |
+|InternVideo2-Chat-8B| 60.3 | 41.9    |
+|InternVideo2-Chat-8B-HD | 65.4 | 46.1|
+|InternVideo2-Chat-8B-HD-F16 | 67.5 | 49.4|
+|InternVideo2-Chat-8B-InternLM| 61.9| **49.1** |
 ## 🚀 How to use the model
 1. make sure to have `transformers >= 4.38.0, peft==0.5.0`