Open lvankampenhout opened 6 years ago
@lvankampenhout - is there a particular postprocessing task where the job doesn't abort correctly?
Hi Alice, I encountered this issue with both the lnd_averages
and timeseries
tasks.
Strangely enough, my jobs do abort today.
Problem: whenever an error occurs somewhere down in the Python code, the batch job hangs and does not abort. When I login onto the compute note I see that there is 100% CPU usage. Not sure if this is a feature of my local cluster (I ported the scripts to SLURM cluster Cartesius) or the postprocessing scripts themselves. However it is clearly sub-optimal because the jobs need to be manually aborted.