Open Nintorac opened 4 years ago
Can you explain more on your requirements?
Will Pytorch jobs be scheduled to Preemptible GPU:s by default if they are present?
The same question. How will PJ pods behave on instances which can stop/resume/reshedule workloads?
The same question. How will PJ pods behave on instances which can stop/resume/reshedule workloads?
It depends on the training code and restartPolicy you defined in the PyTorchJob yaml. We do not take it as a special case, I think.
Just wondering how this operator handles being run on preemptible GCP instances and where I can find more documentation on the subject
Thanks