EleutherAI / elk

Keeping language models honest by directly eliciting knowledge encoded in their activations.
MIT License
186 stars 33 forks source link

Support exponential moving averages for the covariance statistics on EigenReporter #169

Open norabelrose opened 1 year ago

norabelrose commented 1 year ago

Currently EigenReporter supports streaming updates to its covariance statistics via the update method, but it does an unweighted average. For the training use-case, we'll want to use an exponential moving average since the representations of the model will change over time.