OSC / ood_core

Open OnDemand core library
https://osc.github.io/ood_core/
MIT License
10 stars 30 forks source link

Add OUT_OF_MEMORY state for Slurm #809

Closed robinkar closed 1 year ago

robinkar commented 1 year ago

We are currently setting up OOD in a cluster where Slurm seems to use the state OOM/OUT_OF_MEMORY for jobs that were terminated due to exceeding the allocated memory. This PR makes OOD handle that state too.

Example: image

johrstrom commented 1 year ago

Thanks. I'm on vacation at the moment, but this seems just fine. I'll circle back later this week.