openai / automated-interpretability

923 stars 107 forks source link