Currently, a checkpoint is only stored if we reach the configurated total_steps. However, storing the latest checkpoint at the end of training for cases with fewer actual training steps than specified with total_steps would also be good.
Motivation
To be sure to have at least one checkpoint after a (long) run.
Feature request
Currently, a checkpoint is only stored if we reach the configurated total_steps. However, storing the latest checkpoint at the end of training for cases with fewer actual training steps than specified with total_steps would also be good.
Motivation
To be sure to have at least one checkpoint after a (long) run.