hpc / pavilion2

Pavilion is a Python 3 (3.5+) based framework for running and analyzing tests targeting HPC systems.
https://pavilion2.readthedocs.io/
Other
43 stars 20 forks source link

Check for un-runnable jobs in series #741

Open Paul-Ferrell opened 6 months ago

Paul-Ferrell commented 6 months ago

Series should periodically check node status in for tests with prescribed nodes, and cancel tests if node are no longer up.

Paul-Ferrell commented 6 months ago

Scheduler plugins (advanced) should have a function that checks a group of tests to see if all requested nodes are still UP.