tim-lawson
/

mlsae-pythia-160m-deduped-x256-k32

model_hub_mixin

pytorch_model_hub_mixin

Model card Files Files and versions Community

Edit model card

This model has been pushed to the Hub using the PytorchModelHubMixin integration:

Library: https://github.com/tim-lawson/mlsae
Docs: [More Information Needed]

Downloads last month: 6

Safetensors

Model size

302M params

Tensor type

F32

·

Inference API

Unable to determine this model’s pipeline type. Check the docs .

Collection including tim-lawson/mlsae-pythia-160m-deduped-x256-k32

Multi-Layer Sparse Autoencoders

Single SAEs trained on the residual stream activation vectors from every transformer layer simultaneously: https://arxiv.org/abs/2409.04185 • 30 items • Updated 28 days ago