DataGemma Release Collection A series of pioneering open models that help ground LLMs in real-world data through Data Commons. β’ 2 items β’ Updated 1 day ago β’ 39
General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model Paper β’ 2409.01704 β’ Published 11 days ago β’ 64
Qwen2-VL Collection Vision-language model series based on Qwen2 β’ 11 items β’ Updated 9 days ago β’ 98
Enhance Your Images Collection Some trending Gradio apps on Spaces that you can use to enhance/upscale your images for free. This collection will be kept uptodate with new releases. β’ 7 items β’ Updated 22 days ago β’ 17
Jamba-1.5 Collection The AI21 Jamba family of models are state-of-the-art, hybrid SSM-Transformer instruction following foundation models β’ 2 items β’ Updated 23 days ago β’ 70
XGen-MM-1 models and datasets Collection A collection of all XGen-MM (Foundation LMM) models! β’ 13 items β’ Updated 2 days ago β’ 33
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models Paper β’ 2408.08872 β’ Published 28 days ago β’ 96
π¦ π FalconMamba 7B Collection This collection features the FalconMamba 7B base model, the instruction-tuned version, their 4-bit and GGUF variants, and the demo. β’ 13 items β’ Updated 24 days ago β’ 25
Qwen2-Audio Collection Audio-language model series based on Qwen2 β’ 4 items β’ Updated Aug 9 β’ 37
Qwen2-Math Collection Math-specific model series based on Qwen2 β’ 7 items β’ Updated about 22 hours ago β’ 40
VideoLLaMA 2 Collection Optimized VideoLLaMA with improved spatial-temporal modeling and better audio understanding capability β’ 11 items β’ Updated 14 days ago β’ 17
π MINT-1T Collection Data for "MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens" β’ 13 items β’ Updated Jul 24 β’ 49
Llama 3.1 Collection This collection hosts the transformers and original repos of the Meta Llama 3.1, Llama Guard 3 and Prompt Guard models β’ 11 items β’ Updated Aug 2 β’ 552
Minitron Collection A family of compressed models obtained via pruning and knowledge distillation β’ 6 items β’ Updated 23 days ago β’ 53
emπing series Collection crispy sentence embedding family β’ 4 items β’ Updated 23 days ago β’ 20
NuminaMath Collection Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize β’ 6 items β’ Updated Jul 21 β’ 53
πͺ SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos β’ 12 items β’ Updated 27 days ago β’ 168
ColPali Paper Resources Collection Main resources for the paper: "ColPali: Efficient Document Retrieval with Vision Language Models" β’ 3 items β’ Updated Jul 2 β’ 4
InternVL 2.0 Collection Expanding Performance Boundaries of Open-Source MLLM β’ 16 items β’ Updated Aug 10 β’ 69
Probably function calling datasets Collection Created using the https://huggingface.co/spaces/librarian-bots/dataset-column-search-api Space. β’ 39 items β’ Updated Jul 17 β’ 35
LLM Compiler Collection Meta LLM Compiler is a state-of-the-art LLM that builds upon Code Llama with improved performance for code optimization and compiler reasoning. β’ 4 items β’ Updated Jun 27 β’ 147
view article Article BM25 for Python: Achieving high performance while simplifying dependencies with *BM25S*β‘ By xhluca β’ Jul 9 β’ 34
view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models Jun 24 β’ 164
The Prompt Report: A Systematic Survey of Prompting Techniques Paper β’ 2406.06608 β’ Published Jun 6 β’ 52
SSMs Collection A collection of Mamba-2-based research models with 8B parameters trained on 3.5T tokens for comparison with Transformers. β’ 5 items β’ Updated Jul 17 β’ 23
C4AI Aya 23 Collection Aya 23 is an open weights research release of an instruction fine-tuned model with highly advanced multilingual capabilities. β’ 4 items β’ Updated Aug 6 β’ 45
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. β’ 27 items β’ Updated 1 day ago β’ 457
Yi 1.5 GGUFs Collection Collection of Yi 1.5 GGUFs made with gguf-my-repo β’ 15 items β’ Updated May 20 β’ 4
MAmmoTH2 Collection Scaling up instruction data from the web for to build better LLMs β’ 11 items β’ Updated May 26 β’ 8
Searching for Better ViT Baselines Collection Exploring ViT hparams and model shapes for the GPU poor (between tiny and base). β’ 25 items β’ Updated 23 days ago β’ 12
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma β’ 16 items β’ Updated Jul 31 β’ 133
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases β’ 5 items β’ Updated Aug 2 β’ 671
[lecture artifacts] aligning open language models Collection artifacts referenced in the talk timeline! Slides: https://docs.google.com/presentation/d/1quMyI4BAx4rvcDfk8jjv063bmHg4RxZd9mhQloXpMn0/edit?usp=sharin β’ 63 items β’ Updated Apr 17 β’ 55