Granite 3.0 Language Models Collection A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 8 items • Updated 25 days ago • 93
TriLMs-Unpacked Collection TriLMs unpacked to FP16 - compatible with any implementation supporting LLaMa architecture in huggingface's transformers format. • 9 items • Updated Jul 9 • 4
OpenCulture Collection A multilingual dataset of public domain books and newspapers. • 27 items • Updated 23 days ago • 117
view article Article How to train a new language model from scratch using Transformers and Tokenizers Feb 14, 2020 • 22
Qwen2 Collection Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated 1 day ago • 346
Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Sep 25 • 628
InternVL 2.0 Collection Expanding Performance Boundaries of Open-Source MLLM • 18 items • Updated 3 days ago • 81