Open wihobbs opened 1 month ago
Actually...this may just be a misnamed warning message. For all exceptions, even ones that don't cancel, the warning that posts says:
{ "exception",
"canceling due to exception",
0,
},
I never actually released the jobs above that were submitted with --urgency=0 but if I release one, the flux-job started
message appears. I'll need to go back to the prolog (my original use case for this) and see if it behaves as expected there, but we may just want to think about renaming the message...
I think the ideal case would be to have a separate message for fatal vs. non-fatal exceptions, but that might be tricky given the way the "switch" case works in src/cmd/job/attach.c
It's friday and clearly my brain is turning to mush. @grondo suggested earlier today that for non-fatal exceptions, continue to print the exception but not the "canceling" message, and proceed to "waiting for prolog." That's cleaner.
On Slack @grondo seemed to think this might be a bug in flux-job attach's status line:
Posting an exception with severity>0 should notify the user, but not cancel the job or hang.