Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models Paper • 2407.01906 • Published Jul 2 • 34
Code Llama Family Collection This collection hosts the transformers repos of the Code Llama release • 12 items • Updated Sep 25 • 39
🔍 Daily Picks in Interpretability & Analysis of LMs Collection Outstanding research in interpretability and evaluation of language models, summarized • 82 items • Updated 4 days ago • 91
Image-to-Text Models 📝 Collection This collection contains image captioning and OCR models. • 15 items • Updated Sep 19, 2023 • 5
Enhancing Vision-Language Pre-training with Rich Supervisions Paper • 2403.03346 • Published Mar 5 • 14
Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference Paper • 2403.04132 • Published Mar 7 • 38