Closed fjetter closed 1 year ago
I've seen this sometimes but have not come up with a consistent reproducer. I have suspicions it's related to asyncio.create_subprocess_shell
If there are any debug logs or smth like this I can enable to help us debug this further, let me know. I run into this fairly frequently
@fjetter yeah just enable info
logging for the coiled
logger or pass configure_logging=True
to Cluster
. Might help a little
Switched to a threadpool for this and haven't see anything locally since. I'm assuming we're good.
Fairly frequently I'm running into a weird deadlock during cluster creation when using package_sync.
I'm typically building distributed from a local folder and see the following output
afterwards nothings happens.
If successful, I see the following lines afterwards
To resolve I need to retry, sometimes a couple of times. The issue does not show up in our internal monitoring (I don't see anything in live-errors)