When resuming from a checkpoint, the naming should continue from there.

IntelLabs / coach

Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms

https://intellabs.github.io/coach/

Apache License 2.0

2.32k stars 459 forks source link

When resuming from a checkpoint, the naming should continue from there. #248

Open redknightlois opened 5 years ago

redknightlois commented 5 years ago

In the training procedure sometimes machines get rebooted, etc. When resume happens from checkpoint automated the next checkpoint is no longer going to continue the numbering from where it is left. Which would be useful to keep track of actual training iterations in case of force majeure like Windows rebooting itself for updates, power losses, etc.