Closed fknorr closed 1 year ago
Partially addressed now by the runner restarting GitHub API calls and config.sh
/ run.sh
invocations. The server process should still be informed about irrecoverable errors, and scancel
of a job should issue a restart.
Related to #6 : If a SLURM job dies, e.g. due to job pickup timeout, we should restart it if the corresponding job is still queued in Github.