A Primer on the Inner Workings of Transformer-based Language Models Paper • 2405.00208 • Published Apr 30 • 9