tim-lawson
/

mlsae-pythia-70m-deduped-x2-k32

model_hub_mixin

pytorch_model_hub_mixin

Model card Files Files and versions Community

Edit model card

mlsae-pythia-70m-deduped-x2-k32

A Multi-Layer Sparse Autoencoder (MLSAE) trained on the residual stream activation vectors from every layer of EleutherAI/pythia-70m-deduped with an expansion factor of 2 and k = 32, over 1 billion tokens from monology/pile-uncopyrighted.

For more details, see:

Paper: https://arxiv.org/abs/2409.04185
GitHub repository: https://github.com/tim-lawson/mlsae
Weights & Biases project: https://wandb.ai/timlawson-/mlsae

Downloads last month: 7

Safetensors

Model size

1.05M params

Tensor type

F32

·

Inference API

Unable to determine this model’s pipeline type. Check the docs .

Dataset used to train tim-lawson/mlsae-pythia-70m-deduped-x2-k32

Collection including tim-lawson/mlsae-pythia-70m-deduped-x2-k32

Multi-Layer Sparse Autoencoders

Single SAEs trained on the residual stream activation vectors from every transformer layer simultaneously: https://arxiv.org/abs/2409.04185 • 30 items • Updated Oct 7