Stronger Models are NOT Stronger Teachers for Instruction Tuning Paper • 2411.07133 • Published 14 days ago • 30
hugging-quants/Meta-Llama-3.1-405B-Instruct-AWQ-INT4 Text Generation • Updated Sep 13 • 123k • 37
Transformer Explainer: Interactive Learning of Text-Generative Models Paper • 2408.04619 • Published Aug 8 • 155