EleutherAI / elk

Keeping language models honest by directly eliciting knowledge encoded in their activations.
MIT License
178 stars 33 forks source link

LEACE normalization #242

Closed norabelrose closed 1 year ago

norabelrose commented 1 year ago

Apparently the original CCS code did this, no one noticed until yesterday, and it makes a significant difference to performance