clovaai / stargan-v2

StarGAN v2 - Official PyTorch Implementation (CVPR 2020)
Other
3.5k stars 659 forks source link

Can I resume from a certain checkpoint? #98

Open jennkimm opened 3 years ago

jennkimm commented 3 years ago

I've trained this model, but It stops when 30000th iterations due to out of device memory.

Since we have a limited budgets, we'd like to know if we can resume training from our 20000 iters checkpoints.

eric-yim commented 3 years ago

Use the argument --resume_iter, which is called in solver.py.

if args.resume_iter > 0:
            self._load_checkpoint(args.resume_iter)
danieltanhx commented 1 year ago

and need to set the initial value for --lambda_ds because of line 95 and line 143 of core/solver.py

remember the initial value of ds weight

initial_lambda_ds = args.lambda_ds