interTwin-eu / vk-test-set

A set of Kubernetes manifests wrapped in pytest to validate interlink sidecars
1 stars 0 forks source link

Add a test to ensure failed jobs result into failed pods #2

Open landerlini opened 1 month ago

landerlini commented 1 month ago

Playing with slurm plugin we realized it returns the status completed even when the job fails. This is indeed the expected behavior for slurm semantics, but not for Kubernetes and the resulting pod would behave differently if submitted to different plugins.

An additional test is needed to make sure than when the job failes, the error status is propagated to k8s.

dciangot commented 1 month ago

@landerlini @Bianco95 can you open an issue to the SLURM plugin repo?

landerlini commented 1 month ago

@mbarbetti is ramping up on that repo and he's finding a few other issues that break compatibility with the slurm setup he is using. He can open the issue and I also expect a PR from him at some point.