FNNDSC / pman

A process management system written in python
MIT License
22 stars 33 forks source link

Kubernetes OOM job restarts #207

Open jennydaman opened 2 years ago

jennydaman commented 2 years ago

When a job exceeds its memory limit, Kuberetes restarts it, which is pointless since it's just going to run out of memory again. Moreover, the outputdir is persistent across the restart which means the restart is not starting with a clean state.