In config_full.yaml I would have save_steps: 1000. At some point I would realize that 1000 is too frequent of a step to save, so I stop the job, edit config_full.yaml to have save_steps: 10000 and restart the job. The resume from checkpoint goes as planned, however I would still have checkpoints saved every 1000 steps (original parameter). What do I do wrong?
I'm having next issue. Let say I'm starting a job
In
config_full.yaml
I would havesave_steps: 1000
. At some point I would realize that 1000 is too frequent of a step to save, so I stop the job, editconfig_full.yaml
to havesave_steps: 10000
and restart the job. The resume from checkpoint goes as planned, however I would still have checkpoints saved every 1000 steps (original parameter). What do I do wrong?