Parsl / parsl

Parsl - a Python parallel scripting library
http://parsl-project.org
Apache License 2.0
486 stars 194 forks source link

Non-deterministic hang in CI local tests due to combination of several existing issues. #3019

Open benclifford opened 8 months ago

benclifford commented 8 months ago

Describe the bug I noticed this hang in Github Actions, in the regularly hanging --config local tests. I think it is a combination of several existing issues, combined in a way that is interesting enough to document here.

This sequence of events happens in this test run: https://github.com/Parsl/parsl/actions/runs/7460656838/artifacts/1156696809

Here is the event sequence I have observed in those logs.

To Reproduce Run a parsl task with only a manually launched worker, with the correct task port but incorrect result port.

Expected behavior test completion. no incorrect worker connections. workers should be shut down properly.

Environment github actions on a PR that is hopefully unrelated.

benclifford commented 7 months ago

out of interest, i spot checked a hanging CI run just now and it shows these symptoms.