Debugging support - Githubissues

Today I had a segfault in a step, dependent on the actual arguments of the steps. There is currently simply no way to debug this. At the very least, I need a way to inspect the arguments of any step that has failed. This problem had occurred before when I had only some instances of a step getting oom'd.

By contrast, even nextflow can handle that because you can navigate to a task's folder, inspect the linked files, and re-run the step in isolation ; and we use that feature a lot. Here we're comparatively stuck.

As a lead to go forward, one of the main pain is that we can't access the worker since it's in the background ; but if we raised an exception with the whole task information in the client on error, it could be possible to drop to a debugger and call galp.run or whatever in the prompt to inspect args.

Also, while the short names are good in logs, on errors we absolutely need the full names so that we can use them to write debugging code.

We could consider dumping the arguments on error ; too. The main issue is that it could be huge ; but we can have guards, we can dump to a standalone file or whatever.

In summary we lack, by order of priority:

A way to inspect the arguments of the task that failed.
A way to selectively re-run a task that failed in isolation.
Integration with the debugger on the client.

emorice / galp

Debugging support #90