Closed Nailo closed 2 years ago
Because I run EquiBind on a server with GPU, but it has time limitation. So I must use checkpoint for my training. How can I continue my training by using the checkpoint?
Thank you
You can do that using the checkpoint argument in the .yml file!
Thank you, Hannes. I found that I was getting the error because I added the parameter #PBS_NGPUS.
Because I run EquiBind on a server with GPU, but it has time limitation. So I must use checkpoint for my training. How can I continue my training by using the checkpoint?
Thank you