jbloomAus / SAELens

Training Sparse Autoencoders on Language Models
https://jbloomaus.github.io/SAELens/
MIT License
193 stars 67 forks source link

feat: new saes for gemma-2b-it and feature splitting on gpt2-small-la… #195

Closed jbloomAus closed 6 days ago

jbloomAus commented 6 days ago

Description

New SAEs:

GPT2 small SAEs are already on Neuronpedia here.

Gemma-2b IT SAE on Neuronpedia soon.