Flex3D: Feed-Forward 3D Generation With Flexible Reconstruction Model And Input View Curation Paper • 2410.00890 • Published 1 day ago • 7
Degradation-Guided One-Step Image Super-Resolution with Diffusion Priors Paper • 2409.17058 • Published 7 days ago • 9
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models Paper • 2409.17146 • Published 7 days ago • 87
EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions Paper • 2409.18042 • Published 6 days ago • 30
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 11 items • Updated 7 days ago • 311
view article Article 🌟 Easy Fine-Tuning with Hugging Face SQL Console, Notebook Creator, and SFT By asoria • 8 days ago • 12
OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs Paper • 2409.05152 • Published 24 days ago • 29
view article Article ⚗️ 🔥 Building High-Quality Datasets with distilabel and Prometheus 2 By burtenshaw • Jun 3 • 26
Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published Aug 22 • 110
view article Article Introducing AuraFace: Open-Source Face Recognition and Identity Preservation Models By isidentical • Aug 26 • 34
Embarrassingly Simple Performance Prediction for Abductive Natural Language Inference Paper • 2202.10408 • Published Feb 21, 2022 • 5
view article Article dstack: Your LLM Launchpad - From Fine-Tuning to Serving, Simplified By chansung • Aug 22 • 12
view article Article A failed experiment: Infini-Attention, and why we should keep trying? Aug 14 • 44
ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities Paper • 2305.11172 • Published May 18, 2023 • 1
view article Article The Hallucinations Leaderboard, an Open Effort to Measure Hallucinations in Large Language Models Jan 29 • 14
Parler-TTS: fully open-source high-quality TTS Collection If you want to find out more about how these models were trained and even fine-tune them yourself, check-out the Parler-TTS repository on GitHub. • 7 items • Updated Aug 8 • 43
MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models Paper • 2408.02718 • Published Aug 5 • 60
view article Article Querying Datasets with the Datasets Explorer Chrome Extension By cfahlgren1 • Jul 19 • 6
BRAG-v0.1 Collection BRAG is a series of SLMs (Small Language Models) specifically trained for RAG tasks. We release models with size 1.5b, 7b and 8b. • 4 items • Updated Aug 4 • 13
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA May 24, 2023 • 82
Gemma 2: Improving Open Language Models at a Practical Size Paper • 2408.00118 • Published Jul 31 • 73
Bilateral Reference for High-Resolution Dichotomous Image Segmentation Paper • 2401.03407 • Published Jan 7 • 1
SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement Paper • 2408.00653 • Published Aug 1 • 27
view article Article BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks Jun 18 • 35
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data Paper • 2401.10891 • Published Jan 19 • 58