Have `feature_web` use the cached neuron labels

DavidUdell / sparse_circuit_discovery

Circuit discovery in GPT-2 small, using sparse autoencoding

MIT License

6 stars 1 forks source link

Have `feature_web` use the cached neuron labels #7

Closed DavidUdell closed 9 months ago

DavidUdell commented 9 months ago

With the exception of the toy RASP models, all models run against the current interpretability pipeline generate a set of top-k input token labels per neuron. feature_web should label its output .png neurons with these labels, except for when the model is the RASP model.