Multi-Layer Sparse Autoencoders
Collection
Single SAEs trained on the residual stream activation vectors from every transformer layer simultaneously: https://arxiv.org/abs/2409.04185
•
30 items
•
Updated
This model has been pushed to the Hub using the PytorchModelHubMixin integration: