Open ColemanTom opened 3 months ago
This message is a pretty reasonable explanation of the situation:
Cannot determine whether workflow is running on a-different-host.
What's happening here is that a Cylc client attempted to contact the scheduler, but failed. This can be caused by:
As a safety check, the client will check that the scheduler process is still running by performing a process listing on the box where the scheduler was running. If it finds the same PID with the same CMD, it knows that the scheduler is running, but is not contactable for some reason. If it does not find the process, then it knows that the scheduler has crashed and removes the contact
file so that other Cylc interfaces can see that the workflow is not running (preventing other clients attempting to contact it).
Under normal circumstances the check should not fail. If it does, it's likely a setup / installation / network problem.
I'm not sure this is a bug, so I've not labelled it as one. I'm wondering if the error message can be improved at the end of the below output.
What happened was
cylc psutil
whilst I was reloading a workflowcylc
(don't ask)I wonder if even saying something about that command being to start the workflow rather than it just outputting a fairly random command would be useful. Also, not outputting it twice would be nice.