This contains the weights of a sparse autoencoder I trained on the residual activations of Mistral-7B-Instruct-v0.1. I used The Pile (uncopyrighted) for the training data. As of right now, I have only trained a single SAE (on layer 16), though I may do more in the future.
The easiest way to use the model is with the SAE Lens library.
Here is the training repo.
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API:
The model has no library tag.