Towards Best Practices of Activation Patching in Language Models: Metrics and Methods Paper • 2309.16042 • Published Sep 27, 2023 • 3