Closed siaahmadi closed 1 year ago
I got a similar issue but the ADASYN and KMeansSMOTE
OpenBLAS warning: precompiled NUM_THREADS exceeded, adding auxiliary array for thread metadata. OpenBLAS warning: precompiled NUM_THREADS exceeded, adding auxiliary array for thread metadata. [1] 3072415 segmentation fault (core dumped)
I've been getting this and other C-level issues with nearly every sampler I've tried in imblearn 0.10.1 (recently raised another issue about them here). It's been pretty disappointing tbh.
For me, I've noticed the errors tend to arise with large data sizes. I was able to produce a segfault with SMOTE earlier today (with about a size of (20 million, 100)), but your example is working fine for me.
This looks like some low-level openblas issue. It could be linked to the internal NearestNeighbors
since we don't do such low-level code. I would advise you to report upstream.
One potential issue that I got when releasing if scikit-learn is installed from the defaults channel, then it is built with LLVM/CLANG OMP that is incompatible with the MKL OMP that could be used in the install.
We reported the bug upstream: https://github.com/ContinuumIO/anaconda-issues/issues/13221
Providing the output of
python -m threadpoolctl -i sklearn
would allow checking if there is a mix of libomp and libiomp.
Running the following code leads to a segfault (Python 3.9.2):
Version info: