Closed kanwaljitkhurmi closed 5 months ago
anyone else faced this issue ?
/cc @jagadeeshi2i Would you like to help with this issue? Thank you!
@kanwaljitkhurmi did the launcher start worker and master pods ? Can you share the logs or describe the pod.
I could launch pytorch job for the example - https://github.com/kubeflow/pipelines/blob/master/samples/contrib/pytorch-samples/Pipeline-Bert-Dist.ipynb
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.
Closing this issue. No activity for more than a year.
close
/close
@rimolive: Closing this issue.
Environment
How do you deploy Kubeflow Pipelines (KFP)? As part of the Kubeflow Manifest 1.4
KFP version:
KFP SDK version: 1.8.4
Steps to reproduce
Trying to use Kubeflow PyTorchJob launcher component in the kubeflow pipeline ,however the pipeline component endlessly waits at the main thread with the following logs and does not proceed further with creation of main and worker pods.
Code:
Can you help ?
Expected result
Materials and reference
Labels
Impacted by this bug? Give it a 👍. We prioritise the issues with the most 👍.