Open grondo opened 4 months ago
Note that in this particular case, we had to kill off flux module remove sched-fluxion-qmanager
which was hanging due to the leaked alloc requests issue (can't find the issue right now, feel free to link it here if found)
While reloading fluxion on elcap, several pending jobs were canceled with a fatal job exception such as:
For reference, here's the logs at the time of module reload: