view article Article Improving performance with Arena Learning in post training By satpalsr • Sep 11 • 5
Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published Aug 22 • 118
view article Article A failed experiment: Infini-Attention, and why we should keep trying? Aug 14 • 50
view article Article Outperforming Claude 3.5 Sonnet with Phi-3-mini-4k for graph entity relationship extraction tasks By rcaulk • Aug 19 • 7
Compact Language Models via Pruning and Knowledge Distillation Paper • 2407.14679 • Published Jul 19 • 38
ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools Paper • 2406.12793 • Published Jun 18 • 31
view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 May 28 • 159
view article Article Introducing the Hugging Face LLM Inference Container for Amazon SageMaker May 31, 2023 • 2
view article Article ⚗️ 🧑🏼🌾 Let's grow some Domain Specific Datasets together By burtenshaw • Apr 29 • 29
Large Language Models are Superpositions of All Characters: Attaining Arbitrary Role-play via Self-Alignment Paper • 2401.12474 • Published Jan 23 • 35