kubeflow / common

Common APIs and libraries shared by other Kubeflow operator repositories.
Apache License 2.0
51 stars 73 forks source link

set podgroup failed when pytorchjob failed #179

Closed qiankunli closed 2 years ago

qiankunli commented 2 years ago

I find pytorchjob is failed but relative podgroup is "Inqueue", so in volcano view, podgroup also own the resource. it should set podgroup failed when pytorchjob failed