Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 β’ 40 items β’ Updated 3 days ago β’ 223
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing Paper β’ 2406.08464 β’ Published Jun 12 β’ 65
Recent highlights Collection Some recent models worth checking out β’ 18 items β’ Updated 20 days ago β’ 41
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. β’ 45 items β’ Updated Sep 18 β’ 370
BESA: Pruning Large Language Models with Blockwise Parameter-Efficient Sparsity Allocation Paper β’ 2402.16880 β’ Published Feb 18 β’ 2
Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models β’ 11 items β’ Updated Sep 25 β’ 622
LLM Compiler Collection Meta LLM Compiler is a state-of-the-art LLM that builds upon Code Llama with improved performance for code optimization and compiler reasoning. β’ 4 items β’ Updated Jun 27 β’ 148
view article Article Expanding Model Context and Creating Chat Models with a Single Click By maywell β’ Apr 28 β’ 37
Honorable mentions Collection Some models I've made and I liked but isn't part of a serie. β’ 10 items β’ Updated Feb 4 β’ 6