GPT-Generated Unified Format (GGUF) Collection ease of reading β’ 17 items β’ Updated 2 days ago β’ 10
OpenCoder Collection OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. β’ 9 items β’ Updated 4 days ago β’ 70
Llama3-8B-1.58 Collection A trio of powerful models: fine-tuned from Llama3-8b-Instruct, with BitNet architecture! β’ 3 items β’ Updated Sep 14 β’ 12
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M β’ 10 items β’ Updated about 5 hours ago β’ 172
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases Paper β’ 2402.14905 β’ Published Feb 22 β’ 126
MobileLLM Collection Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 β’ 8 items β’ Updated 15 days ago β’ 95
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning Paper β’ 2410.02884 β’ Published Oct 3 β’ 50
D_AU - Source files for GGUF, EXL2, AWQ, GPTQ, HQQ etc etc Collection Safetensor source files (by David_AU) to use directly and/or create different quants and/or merges. Link to GGUFS/full model card on each. β’ 57 items β’ Updated about 14 hours ago β’ 3
GGUF Image Model Quants Collection List of GGUF quants for text to image base models. β’ 9 items β’ Updated 23 days ago β’ 12
Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs Paper β’ 2404.05719 β’ Published Apr 8 β’ 80
view article Article MedEmbed: Fine-Tuned Embedding Models for Medical / ClinicalΒ IR By abhinand β’ Oct 20 β’ 30
view article Article Advanced Flux Dreambooth LoRA Training with 𧨠diffusers By linoyts ⒠Oct 21 ⒠27
Granite 3.0 Language Models Collection A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. β’ 8 items β’ Updated 17 days ago β’ 89