EleutherAI / elk

Keeping language models honest by directly eliciting knowledge encoded in their activations.
MIT License
178 stars 33 forks source link

Ccs platt scale #244

Closed lauritowal closed 1 year ago

lauritowal commented 1 year ago

adding platt scaling to CCS. Depends on https://github.com/EleutherAI/elk/pull/242

CLAassistant commented 1 year ago

CLA assistant check
All committers have signed the CLA.