jbloomAus / SAELens

Training Sparse Autoencoders on Language Models
https://jbloomaus.github.io/SAELens/
MIT License
193 stars 67 forks source link

[Proposal] tidy normalisation code #151

Open tomMcGrath opened 1 month ago

tomMcGrath commented 1 month ago

Proposal

PR #150 introduces some untidy code in an effort to quickly get evaluation of activation normalisation working. This code should be tidied up, most likely by moving activation normalisation calculations into either the SAE or fully back into the activation store.

Context

I (@tomMcGrath) intend to do this on the week beginning 20th May. If this issue still exists by the 24th, please bug me about it.

Checklist