RedPajama: an Open Dataset for Training Large Language Models Paper • 2411.12372 • Published 3 days ago • 40
ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction? Paper • 2411.06469 • Published 12 days ago • 17
Stronger Models are NOT Stronger Teachers for Instruction Tuning Paper • 2411.07133 • Published 11 days ago • 28
Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding Paper • 2411.04282 • Published 15 days ago • 30
Survey of Cultural Awareness in Language Models: Text and Beyond Paper • 2411.00860 • Published 23 days ago • 23
LOGO -- Long cOntext aliGnment via efficient preference Optimization Paper • 2410.18533 • Published 29 days ago • 42
MedMobile: A mobile-sized language model with expert-level clinical capabilities Paper • 2410.09019 • Published Oct 11 • 8
A Comparative Study on Reasoning Patterns of OpenAI's o1 Model Paper • 2410.13639 • Published Oct 17 • 16
BenTo: Benchmark Task Reduction with In-Context Transferability Paper • 2410.13804 • Published Oct 17 • 20
Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts Paper • 2410.10626 • Published Oct 14 • 37
SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning Paper • 2410.09754 • Published Oct 13 • 7
Thinking LLMs: General Instruction Following with Thought Generation Paper • 2410.10630 • Published Oct 14 • 16
ChroKnowledge: Unveiling Chronological Knowledge of Language Models in Multiple Domains Paper • 2410.09870 • Published Oct 13 • 7
Everything Everywhere All at Once: LLMs can In-Context Learn Multiple Tasks in Superposition Paper • 2410.05603 • Published Oct 8 • 11
Self-Boosting Large Language Models with Synthetic Preference Data Paper • 2410.06961 • Published Oct 9 • 15
One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation Paper • 2410.07170 • Published Oct 9 • 15