OpenMOSS / Language-Model-SAEs

For OpenMOSS Mechanistic Interpretability Team's Sparse Autoencoder (SAE) research.
21 stars 3 forks source link

Resolves #10 #19

Closed dest1n1s closed 3 weeks ago

dest1n1s commented 3 weeks ago

This PR includes: