Alpha-CLIP: A CLIP Model Focusing on Wherever You Want Paper • 2312.03818 • Published Dec 6, 2023 • 32
TEAL: Tokenize and Embed ALL for Multi-modal Large Language Models Paper • 2311.04589 • Published Nov 8, 2023 • 18
FLIQS: One-Shot Mixed-Precision Floating-Point and Integer Quantization Search Paper • 2308.03290 • Published Aug 7, 2023 • 5
Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP Paper • 2308.02487 • Published Aug 4, 2023 • 12
Unified Model for Image, Video, Audio and Language Tasks Paper • 2307.16184 • Published Jul 30, 2023 • 14
EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone Paper • 2307.05463 • Published Jul 11, 2023 • 10