Image-to-text models Collection of image captioning models Salesforce/blip-image-captioning-large Image-to-Text • Updated Dec 7, 2023 • 1.03M • 923 microsoft/git-large-coco Image-to-Text • Updated Jun 26, 2023 • 5.12k • 93 Salesforce/instructblip-vicuna-7b Image-to-Text • Updated Apr 12 • 241k • 74 Salesforce/blip2-flan-t5-xxl Image-to-Text • Updated Mar 29 • 8.31k • 80
SigLIP release SigLIP improves upon CLIP with a sigmoid loss. Both English-only and multilingual checkpoints are released. Sigmoid Loss for Language Image Pre-Training Paper • 2303.15343 • Published Mar 27, 2023 • 4 google/siglip-base-patch16-224 Zero-Shot Image Classification • Updated Jan 19 • 152k • 11 google/siglip-base-patch16-256 Zero-Shot Image Classification • Updated Jan 19 • 3.17k google/siglip-base-patch16-384 Zero-Shot Image Classification • Updated Jan 19 • 1.39k • 8