view article Article Improving Hugging Face Training Efficiency Through Packing with Flash Attention 24 days ago • 19
mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval Paper • 2407.19669 • Published Jul 29 • 17
GTE models Collection General Text Embedding Models Released by Alibaba Group • 19 items • Updated Aug 6 • 9
Qwen2 Collection Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 35 items • Updated Aug 8 • 325
Nomic Embed Vision Collection Vision Encoders aligned to Nomic Embed Text making Nomic Embed multimodal! • 2 items • Updated Jun 5 • 5