Closed robinkar closed 1 year ago
We are currently setting up OOD in a cluster where Slurm seems to use the state OOM/OUT_OF_MEMORY for jobs that were terminated due to exceeding the allocated memory. This PR makes OOD handle that state too.
Example:
Thanks. I'm on vacation at the moment, but this seems just fine. I'll circle back later this week.
We are currently setting up OOD in a cluster where Slurm seems to use the state OOM/OUT_OF_MEMORY for jobs that were terminated due to exceeding the allocated memory. This PR makes OOD handle that state too.
Example: