epam / cloud-pipeline

Cloud agnostic genomics analysis, scientific computation and storage platform
https://cloud-pipeline.com
Apache License 2.0
145 stars 59 forks source link

Force deleted hanging pods #3478

Open mzueva opened 6 months ago

mzueva commented 6 months ago

Background After these changes 43266d771bc302039e05761732d5519450a9d800 we removed force delete for pods which sometimes lead to pod hanging in terminated status for unlimited period.

Approach PodMonitor shall contain in-memory list of pods which failed to be deleted and if pod is not deleted in N attempts (set as SystemPreference) pod shall be force deleted.