EleutherAI / elk

Keeping language models honest by directly eliciting knowledge encoded in their activations.
MIT License
175 stars 32 forks source link

Add log softmax to model_logits #291

Closed norabelrose closed 10 months ago

norabelrose commented 10 months ago

Ensures that the language model probabilities are normalized across pseudo-labels