Solar Pro Collection The most intelligent LLM on a single GPU โข 3 items โข Updated 23 days ago โข 9
C4AI Aya 23 Collection Aya 23 is an open weights research release of an instruction fine-tuned model with highly advanced multilingual capabilities. โข 4 items โข Updated Aug 6 โข 45
Yi 1.5 GGUFs Collection Collection of Yi 1.5 GGUFs made with gguf-my-repo โข 15 items โข Updated May 20 โข 4
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. โข 27 items โข Updated 17 days ago โข 473
MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs Paper โข 2402.15627 โข Published Feb 23 โข 33
C4AI Command R Collection C4AI Command-R is a research release of a 35 billion parameter highly performant generative model. Command-R is a large language model with open weigh โข 4 items โข Updated Aug 30 โข 16
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper โข 2402.17764 โข Published Feb 27 โข 592
Frankenmodels Collection They're not supposed to be that size! Neat, right? โข 8 items โข Updated Dec 12, 2023 โข 3