view article Article PyTorchModelHubMixin: Bridging the Gap for Custom AI Models on Hugging Face By not-lain • 8 days ago • 11
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 • 40 items • Updated about 21 hours ago • 207
view article Article Recipe: Preparing Multilingual Speech Datasets for TTS Training By PHBJT • 15 days ago • 14
MobileLLM Collection Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 8 items • Updated 12 days ago • 95
view article Article How to optimize your data labelling project with custom interfaces By burtenshaw • Oct 16 • 18
view article Article Synthetic dataset generation techniques: Self-Instruct By davanstrien • May 15 • 12
Llama3-ChatQA-1.5 Collection Llama3-ChatQA-1.5 models excel at conversational question answering (QA) and retrieval-augmented generation (RAG). • 6 items • Updated Oct 1 • 41
💻 Local SmolLMs Collection SmolLM models in MLC, ONNX and GGUF format for local applications + in-browser demos • 14 items • Updated Aug 20 • 45
Papers about model merging Collection referenced in the mergekit repo: https://github.com/cg123/mergekit • 4 items • Updated Feb 13 • 14
view article Article DuckDB: run SQL queries on 50,000+ datasets on the Hugging Face Hub Jun 7, 2023 • 4