Closed rmshaffer closed 2 years ago
One possibility to investigate: Upgrade Microsoft.Jupyter.Core to consume a recent version of NetMQ and take advantage of new thread-safe socket implementation from https://github.com/zeromq/netmq/pull/871.
One possibility to investigate: Upgrade Microsoft.Jupyter.Core to consume a recent version of NetMQ and take advantage of new thread-safe socket implementation from zeromq/netmq#871.
I wonder if this may also be the root cause of #510?
One possibility to investigate: Upgrade Microsoft.Jupyter.Core to consume a recent version of NetMQ and take advantage of new thread-safe socket implementation from zeromq/netmq#871.
I wonder if this may also be the root cause of #510?
It does indeed sound very similar. I don't know that the proposed upgrade of NetMQ would actually resolve this, but it's my best guess as to what might be going wrong here since, at least from what I observed, the process appeared to be hung in that part of the NetMQ stack.
This should be fixed with #708, which is now released as part of the QDK 0.25.228311+
Describe the bug When running the IQ# kernel in an environment with
pyzmq>=20.0.0
, we see frequent intermittent hangs where the kernel seems to stop responding.It seems there is an unhandled exception being thrown from NetMQ code, which is consumed by Microsoft.Jupyter.Core. For example, in one instance I saw the following unhandled exception stack trace:
To Reproduce Steps to reproduce the behavior:
pyzmq>=20.0.0
.We especially observed this when our CI runs were installing
pyzmq>=20.0.0
.Expected behavior No hangs. All commands should complete.
System information
Additional context This has been mitigated for now in #371 by adding
pyzmq<20.0.0
to the requirements of the IQ# Python and Conda packages, as well as pinning our CI builds topyzmq==19.0.2
. But it would be nice to fix this underlying issue so that we can remove that restriction and consume newer versions ofpyzmq
as needed.