Open brightsparc opened 2 years ago
The backtrace shows a referencing issue on the Python side. Not sure which objects trigger it. Clearly multithreaded, so perhaps reference handling issue in which a temporary ref counts dropped resulting incorrect collection and reuse of a live object. Doesn't have to be JPype code itself, though it is certainly suspect. If there is a clear method to simplify this and replicate it could be posted to the JPype issues, though these are often very difficult to trace down unless one can isolate what is happening at the time so we can isolate to the path.
Thanks for raising the issue! As @Thrameos mentioned, it can be difficult to track down the source of these kinds of errors without a reproducer. @brightsparc Would it be possible to also include the jpype, dask and distributed versions from an earlier environment where these segfaults weren't seen. Also are you using a LocalCluster
for parallelizing the dask computations or is this just with a dask-sql context?
What happened:
Having upgrade to
dask-sql-2022.4.1
I have been seeing a number of seg faults that appear to be related to theJPype
java iterop.What you expected to happen:
I was expecting dask-sql to be stable, but I require this upgraded version to support case insensitive queries with dask.
Minimal Complete Verifiable Example:
I am yet to repro with a concise example, but have included the seg fault log file if this helps.
Anything else we need to know?:
This happens both locally on mac arm, and also in ubuntu linux build runners.
Environment: