EleutherAI / elk

Keeping language models honest by directly eliciting knowledge encoded in their activations.
MIT License
175 stars 32 forks source link

Linear Discriminant Analysis MVP #268

Open norabelrose opened 1 year ago

norabelrose commented 1 year ago

Adds LdaFitter for supervised LDA reporters