EleutherAI / elk

Keeping language models honest by directly eliciting knowledge encoded in their activations.
MIT License
186 stars 33 forks source link

Add log softmax to model_logits #291

Closed norabelrose closed 1 year ago

norabelrose commented 1 year ago

Ensures that the language model probabilities are normalized across pseudo-labels