I wonder how to load checkpoint and continue training. Because of the OOM (maybe at 30 epochs), I have to load checkpoint and continue training after OOM. I try to add ckpt_path = '....ckpt' (path from lightning_logs/checkpoint), but it seems useless. How should I solve this problem? Thanks a lot!
I wonder how to load checkpoint and continue training. Because of the OOM (maybe at 30 epochs), I have to load checkpoint and continue training after OOM. I try to add ckpt_path = '....ckpt' (path from lightning_logs/checkpoint), but it seems useless. How should I solve this problem? Thanks a lot!