EleutherAI / elk

Keeping language models honest by directly eliciting knowledge encoded in their activations.
MIT License
178 stars 33 forks source link

Raw extraction #200

Closed AlexTMallen closed 1 year ago

AlexTMallen commented 1 year ago
norabelrose commented 1 year ago

Won't merge as-is; let's talk about ways to accomplish a similar goal within the templates system perhaps