OpenGVLab
/

InternVL-14B-224px

Image Feature Extraction

feature-extraction

Model card Files Files and versions Community

czczup commited on Feb 11

Commit

d76eb45

•

1 Parent(s): c4036dd

Update README.md

Files changed (1) hide show

README.md +10 -0

README.md CHANGED Viewed

@@ -30,6 +30,16 @@ It is _**the largest open-source vision/vision-language foundation model (14B)**
   - Image size: 224 x 224
 - **Pretrain Dataset:** LAION-en, LAION-COCO, COYO, CC12M, CC3M, SBU, Wukong, LAION-multi
 ## Model Usage
 ```python

   - Image size: 224 x 224
 - **Pretrain Dataset:** LAION-en, LAION-COCO, COYO, CC12M, CC3M, SBU, Wukong, LAION-multi
+## Zero-Shot Performance
+See this [document](https://github.com/OpenGVLab/InternVL/tree/main) for more details about the zero-shot evaluation.
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/64119264f0f81eb569e0d569/KfsrXioPU77T48sRb60oL.png)
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/64119264f0f81eb569e0d569/q5UkfrEix6w3mnn_1w4ja.png)
 ## Model Usage
 ```python