AlignmentResearch / tuned-lens

Tools for understanding how transformer predictions are built layer-by-layer
https://tuned-lens.readthedocs.io/en/latest/
MIT License
432 stars 47 forks source link

Fix silent failure when training runs out of data #113

Closed levmckinney closed 7 months ago

levmckinney commented 1 year ago

Currently, the tuned lens trainer does not error or warn when it runs out of training data before reaching the requested number of steps. This, can be seen in our training run on Anthropic/hh-rlhf.

Ideally, this should error prior to training begin with something like insufficient data for steps requested. At the very least it should emit some kind of warning at the end of training.