OpenScholar_V1 Collection The set of models, index, data associated with the paper "OpenScholar: Synthesizing Scientific Literature with Retrieval-Augmented LMs". β’ 8 items β’ Updated 2 days ago β’ 21
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. β’ 7 items β’ Updated about 9 hours ago β’ 17
view article Article Halo: Open Source Health Tracking with Wearables By cyrilzakka β’ 4 days ago β’ 75
Thinking LLMs: General Instruction Following with Thought Generation Paper β’ 2410.10630 β’ Published Oct 14 β’ 16
TPO: Aligning Large Language Models with Multi-branch & Multi-step Preference Trees Paper β’ 2410.12854 β’ Published Oct 10 β’ 1
view article Article Synthetic dataset generation techniques: Self-Instruct By davanstrien β’ May 15 β’ 12
view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais β’ 10 days ago β’ 94
π«π· Calme-3 Collection Here you can find all the new Calme-3 models β’ 26 items β’ Updated about 3 hours ago β’ 7
RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning Paper β’ 2410.02089 β’ Published Oct 2 β’ 12
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases Paper β’ 2402.14905 β’ Published Feb 22 β’ 126
MobileLLM Collection Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 β’ 8 items β’ Updated 17 days ago β’ 95
C4AI Aya Expanse Collection Aya Expanse is an open-weight research release of a model with highly advanced multilingual capabilities. β’ 3 items β’ Updated about 1 month ago β’ 26
AutoTrain: No-code training for state-of-the-art models Paper β’ 2410.15735 β’ Published Oct 21 β’ 57