stanfordnlp / pyvene

Stanford NLP Python Library for Understanding and Improving PyTorch Models via Interventions
http://pyvene.ai
Apache License 2.0
652 stars 63 forks source link

[Minor] Adding interchange intervention for SAEs #187

Closed frankaging closed 2 months ago

frankaging commented 2 months ago

Description

Tutorial change.

Testing Done

N.A.

Checklist: