tim-lawson
/

mlsae-gpt2-x64-k32

model_hub_mixin

pytorch_model_hub_mixin

Model card Files Files and versions Community

tim-lawson commited on Dec 2, 2024

Commit

7b07814

·

verified ·

1 Parent(s): 756c936

Update README.md

Files changed (1) hide show

README.md +20 -5

README.md CHANGED Viewed

@@ -1,9 +1,24 @@
 ---
 tags:
-- model_hub_mixin
-- pytorch_model_hub_mixin
 ---
-This model has been pushed to the Hub using the [PytorchModelHubMixin](https://huggingface.co/docs/huggingface_hub/package_reference/mixins#huggingface_hub.PyTorchModelHubMixin) integration:
-- Library: [More Information Needed]
-- Docs: [More Information Needed]

 ---
+language: en
+library_name: mlsae
+license: mit
 tags:
+  - model_hub_mixin
+  - pytorch_model_hub_mixin
+datasets:
+  - monology/pile-uncopyrighted
 ---
+# mlsae-gpt2-x64-k32
+A Multi-Layer Sparse Autoencoder (MLSAE) trained on the residual stream
+activation vectors from every layer of
+[openai-community/gpt2](https://huggingface.co/openai-community/gpt2)
+with an expansion factor of 64 and k = 32, over 1 billion tokens from
+[monology/pile-uncopyrighted](https://huggingface.co/datasets/monology/pile-uncopyrighted).
+For more details, see:
+- Paper: <https://arxiv.org/abs/2409.04185>
+- GitHub repository: <https://github.com/tim-lawson/mlsae>
+- Weights & Biases project: <https://wandb.ai/timlawson-/mlsae>