ai-safety-foundation / sparse_autoencoder

Sparse Autoencoder for Mechanistic Interpretability
https://ai-safety-foundation.github.io/sparse_autoencoder/
MIT License
185 stars 39 forks source link

Add component dimension support to the metrics #161

Closed alan-cooney closed 10 months ago