openGPT-X/Teuken-7B-instruct-commercial-v0.4 Text Generation • Updated about 4 hours ago • 809 • 19
Marqo-Ecommerce-Embeddings Collection State-of-the-art embedding models fine-tuned for the ecommerce domain. +67% increase in evaluation metrics vs ViT-B-16-SigLIP. • 10 items • Updated 13 days ago • 16
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated about 5 hours ago • 179
MobileLLM Collection Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 9 items • Updated about 16 hours ago • 97
meta-llama/Llama-3.2-1B-Instruct-QLORA_INT4_EO8 Text Generation • Updated 9 days ago • 589 • 27
Granite 3.0 Language Models Collection A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 8 items • Updated 23 days ago • 92
LayerSkip Collection Models continually pretrained using LayerSkip - https://arxiv.org/abs/2404.16710 • 8 items • Updated 6 days ago • 43