Open grondo opened 2 weeks ago
@trws - is there any way the exception wrapper fluxion puts around mod_main()
could be causing it to be called repeatedly or recursively? That woud be
https://github.com/flux-framework/flux-sched/blob/master/src/common/c%2B%2Bwrappers/eh_wrapper.hpp
which is way over my head.
It certainly shouldn't be. The passed function object is called only once. If there's a way the call from the event loop can be retried, like if it's not reset or something, then it's possible an exception could be causing it to exit before hitting that point and the event loop is re-calling it but it would have to be something like that.
In the logs captured below, the scheduler was apparently loadedat 10:58, then we see the messages:
Followed by what appears to be a
hello
storm, with at least 35hello
requests sent to the job manager if the logs can be believed. Full logs here: