openai / automated-interpretability

977 stars 116 forks source link

Is there a demo that shows this great project? #35

Closed guotong1988 closed 11 months ago

guotong1988 commented 11 months ago

Thank you very much!

hijohnnylin commented 11 months ago

[disclaimer: i am the creator of neuronpedia]

check out neuronpedia.org. it uses automated-interpretability for scoring gpt2-small neuron explanations - and it lets anyone contribute their own explanations too.