Closed goswamig closed 5 months ago
@akartsky @surajkota @mbaijal @ryansteakley FYI.
/area components/aws/sagemaker
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.
This issue has been automatically closed because it has not had recent activity. Please comment "/reopen" to reopen it.
What steps did you take
If node scales/up down, the sagemaker component tries to create the same job which fails. Since sagemaker does not let create the same name job. Component controller should be able to detect this and resume the job from existing state.
What happened:
the job hangs/fail
What did you expect to happen:
I expect the job to resume from previous state.
Environment:
kfp-1.6
Impacted by this bug? Give it a 👍. We prioritise the issues with the most 👍.