[BUG] run in pyscenic grn

zhangdong360 commented 4 months ago

Describe the bug When I run pySCENIC, I often encounter disturbing warnings. I checked the problem may be associated with me this question. https://github.com/aertslab/pySCENIC/issues/482 But I'm not using port 8787. On the other hand, I don't often encounter this warning on the HPC where I have Rstudio server installed, and I don't think it has anything to do with it. I think the problem might be with dask, but I'm not well versed in it. On the other hand, the lack of output, which makes me cannot judge whether I need to run the program. As mentioned above, re-running the program will most likely encounter warning again. In addition, I have tried arboreto_with_multiprocessing.py, but it was too inefficient, I tested it on small samples, and it was nearly twice as slow as pySCENIC for the same number of CPU cores. I don't think that's acceptable in a large sample. It took me too much energy in to run the program, I have to sample cut to my data size, but I don't think this is a long-term solution.

(scanpy) [zhangdong_2@jupyterlab-md-npeer5o1 pySCENIC]$ cat step1.out 

2024-07-02 15:30:29,526 - pyscenic.cli.pyscenic - INFO - Loading expression matrix.

2024-07-02 15:32:35,579 - pyscenic.cli.pyscenic - INFO - Inferring regulatory networks.
Numba: Attempted to fork from a non-main thread, the TBB library may be in an invalid state in the child process.
Numba: Attempted to fork from a non-main thread, the TBB library may be in an invalid state in the child process.
Numba: Attempted to fork from a non-main thread, the TBB library may be in an invalid state in the child process.
Numba: Attempted to fork from a non-main thread, the TBB library may be in an invalid state in the child process.
Numba: Attempted to fork from a non-main thread, the TBB library may be in an invalid state in the child process.
Numba: Attempted to fork from a non-main thread, the TBB library may be in an invalid state in the child process.
Numba: Attempted to fork from a non-main thread, the TBB library may be in an invalid state in the child process.
Numba: Attempted to fork from a non-main thread, the TBB library may be in an invalid state in the child process.
Numba: Attempted to fork from a non-main thread, the TBB library may be in an invalid state in the child process.
Numba: Attempted to fork from a non-main thread, the TBB library may be in an invalid state in the child process.
Numba: Attempted to fork from a non-main thread, the TBB library may be in an invalid state in the child process.
Numba: Attempted to fork from a non-main thread, the TBB library may be in an invalid state in the child process.
Numba: Attempted to fork from a non-main thread, the TBB library may be in an invalid state in the child process.
Numba: Attempted to fork from a non-main thread, the TBB library may be in an invalid state in the child process.
Numba: Attempted to fork from a non-main thread, the TBB library may be in an invalid state in the child process.
Numba: Attempted to fork from a non-main thread, the TBB library may be in an invalid state in the child process.
Numba: Attempted to fork from a non-main thread, the TBB library may be in an invalid state in the child process.
Numba: Attempted to fork from a non-main thread, the TBB library may be in an invalid state in the child process.
2024-07-02 15:34:13,518 - distributed.worker - WARNING - Could not find data: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ['tcp://127.0.0.1:46641', 'tcp://127.0.0.1:43849', 'tcp://127.0.0.1:44125', 'tcp://127.0.0.1:44847', 'tcp://127.0.0.1:37827']} on workers: [] (who_has: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ['tcp://127.0.0.1:46641', 'tcp://127.0.0.1:43849', 'tcp://127.0.0.1:44125', 'tcp://127.0.0.1:44847', 'tcp://127.0.0.1:37827']})
2024-07-02 15:34:13,518 - distributed.worker - WARNING - Could not find data: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ['tcp://127.0.0.1:46641', 'tcp://127.0.0.1:43849', 'tcp://127.0.0.1:44125', 'tcp://127.0.0.1:44847', 'tcp://127.0.0.1:37827']} on workers: [] (who_has: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ['tcp://127.0.0.1:46641', 'tcp://127.0.0.1:43849', 'tcp://127.0.0.1:44125', 'tcp://127.0.0.1:44847', 'tcp://127.0.0.1:37827']})
2024-07-02 15:34:13,519 - distributed.scheduler - WARNING - Worker tcp://127.0.0.1:33897 failed to acquire keys: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ('tcp://127.0.0.1:46641', 'tcp://127.0.0.1:43849', 'tcp://127.0.0.1:44125', 'tcp://127.0.0.1:44847', 'tcp://127.0.0.1:37827')}
2024-07-02 15:34:13,520 - distributed.scheduler - WARNING - Worker tcp://127.0.0.1:33279 failed to acquire keys: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ('tcp://127.0.0.1:46641', 'tcp://127.0.0.1:43849', 'tcp://127.0.0.1:44125', 'tcp://127.0.0.1:44847', 'tcp://127.0.0.1:37827')}
2024-07-02 15:34:13,530 - distributed.worker - WARNING - Could not find data: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ['tcp://127.0.0.1:46641', 'tcp://127.0.0.1:43849', 'tcp://127.0.0.1:44125', 'tcp://127.0.0.1:44847', 'tcp://127.0.0.1:37827']} on workers: [] (who_has: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ['tcp://127.0.0.1:46641', 'tcp://127.0.0.1:43849', 'tcp://127.0.0.1:44125', 'tcp://127.0.0.1:44847', 'tcp://127.0.0.1:37827']})
2024-07-02 15:34:13,531 - distributed.scheduler - WARNING - Worker tcp://127.0.0.1:33285 failed to acquire keys: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ('tcp://127.0.0.1:46641', 'tcp://127.0.0.1:43849', 'tcp://127.0.0.1:44125', 'tcp://127.0.0.1:44847', 'tcp://127.0.0.1:37827')}
2024-07-02 15:34:28,077 - distributed.worker - WARNING - Could not find data: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ['tcp://127.0.0.1:32843', 'tcp://127.0.0.1:32855', 'tcp://127.0.0.1:42003', 'tcp://127.0.0.1:45323', 'tcp://127.0.0.1:46849']} on workers: [] (who_has: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ['tcp://127.0.0.1:32843', 'tcp://127.0.0.1:32855', 'tcp://127.0.0.1:42003', 'tcp://127.0.0.1:45323', 'tcp://127.0.0.1:46849']})
2024-07-02 15:34:28,079 - distributed.scheduler - WARNING - Worker tcp://127.0.0.1:46641 failed to acquire keys: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ('tcp://127.0.0.1:32843', 'tcp://127.0.0.1:32855', 'tcp://127.0.0.1:42003', 'tcp://127.0.0.1:45323', 'tcp://127.0.0.1:46849')}
2024-07-02 15:34:28,086 - distributed.worker - WARNING - Could not find data: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ['tcp://127.0.0.1:32843', 'tcp://127.0.0.1:32855', 'tcp://127.0.0.1:42003', 'tcp://127.0.0.1:45323', 'tcp://127.0.0.1:46849']} on workers: [] (who_has: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ['tcp://127.0.0.1:32843', 'tcp://127.0.0.1:32855', 'tcp://127.0.0.1:42003', 'tcp://127.0.0.1:45323', 'tcp://127.0.0.1:46849']})
2024-07-02 15:34:28,088 - distributed.scheduler - WARNING - Worker tcp://127.0.0.1:33285 failed to acquire keys: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ('tcp://127.0.0.1:32843', 'tcp://127.0.0.1:32855', 'tcp://127.0.0.1:42003', 'tcp://127.0.0.1:45323', 'tcp://127.0.0.1:46849')}
2024-07-02 15:37:24,939 - distributed.worker - WARNING - Could not find data: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ['tcp://127.0.0.1:32843', 'tcp://127.0.0.1:46641', 'tcp://127.0.0.1:42003', 'tcp://127.0.0.1:44847', 'tcp://127.0.0.1:46849']} on workers: [] (who_has: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ['tcp://127.0.0.1:32843', 'tcp://127.0.0.1:46641', 'tcp://127.0.0.1:42003', 'tcp://127.0.0.1:44847', 'tcp://127.0.0.1:46849']})
2024-07-02 15:37:24,942 - distributed.worker - WARNING - Could not find data: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ['tcp://127.0.0.1:32843', 'tcp://127.0.0.1:46641', 'tcp://127.0.0.1:42003', 'tcp://127.0.0.1:44847', 'tcp://127.0.0.1:46849']} on workers: [] (who_has: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ['tcp://127.0.0.1:32843', 'tcp://127.0.0.1:46641', 'tcp://127.0.0.1:42003', 'tcp://127.0.0.1:44847', 'tcp://127.0.0.1:46849']})
2024-07-02 15:37:24,943 - distributed.worker - WARNING - Could not find data: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ['tcp://127.0.0.1:32843', 'tcp://127.0.0.1:46641', 'tcp://127.0.0.1:42003', 'tcp://127.0.0.1:44847', 'tcp://127.0.0.1:46849']} on workers: [] (who_has: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ['tcp://127.0.0.1:32843', 'tcp://127.0.0.1:46641', 'tcp://127.0.0.1:42003', 'tcp://127.0.0.1:44847', 'tcp://127.0.0.1:46849']})
2024-07-02 15:37:24,972 - distributed.scheduler - WARNING - Worker tcp://127.0.0.1:44125 failed to acquire keys: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ('tcp://127.0.0.1:32843', 'tcp://127.0.0.1:46641', 'tcp://127.0.0.1:42003', 'tcp://127.0.0.1:44847', 'tcp://127.0.0.1:46849')}
2024-07-02 15:37:24,973 - distributed.scheduler - WARNING - Worker tcp://127.0.0.1:43849 failed to acquire keys: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ('tcp://127.0.0.1:32843', 'tcp://127.0.0.1:46641', 'tcp://127.0.0.1:42003', 'tcp://127.0.0.1:44847', 'tcp://127.0.0.1:46849')}
2024-07-02 15:37:24,973 - distributed.scheduler - WARNING - Worker tcp://127.0.0.1:42931 failed to acquire keys: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ('tcp://127.0.0.1:32843', 'tcp://127.0.0.1:46641', 'tcp://127.0.0.1:42003', 'tcp://127.0.0.1:44847', 'tcp://127.0.0.1:46849')}
2024-07-02 15:37:40,544 - distributed.worker - WARNING - Could not find data: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ['tcp://127.0.0.1:45323', 'tcp://127.0.0.1:44201', 'tcp://127.0.0.1:33279', 'tcp://127.0.0.1:35067', 'tcp://127.0.0.1:33285']} on workers: [] (who_has: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ['tcp://127.0.0.1:45323', 'tcp://127.0.0.1:44201', 'tcp://127.0.0.1:33279', 'tcp://127.0.0.1:35067', 'tcp://127.0.0.1:33285']})
2024-07-02 15:37:40,545 - distributed.scheduler - WARNING - Worker tcp://127.0.0.1:42931 failed to acquire keys: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ('tcp://127.0.0.1:45323', 'tcp://127.0.0.1:44201', 'tcp://127.0.0.1:33279', 'tcp://127.0.0.1:35067', 'tcp://127.0.0.1:33285')}
2024-07-02 15:37:40,562 - distributed.worker - WARNING - Could not find data: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ['tcp://127.0.0.1:45323', 'tcp://127.0.0.1:44201', 'tcp://127.0.0.1:33279', 'tcp://127.0.0.1:35067', 'tcp://127.0.0.1:33285']} on workers: [] (who_has: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ['tcp://127.0.0.1:45323', 'tcp://127.0.0.1:44201', 'tcp://127.0.0.1:33279', 'tcp://127.0.0.1:35067', 'tcp://127.0.0.1:33285']})
2024-07-02 15:37:40,564 - distributed.worker - WARNING - Could not find data: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ['tcp://127.0.0.1:45323', 'tcp://127.0.0.1:44201', 'tcp://127.0.0.1:33279', 'tcp://127.0.0.1:35067', 'tcp://127.0.0.1:33285']} on workers: [] (who_has: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ['tcp://127.0.0.1:45323', 'tcp://127.0.0.1:44201', 'tcp://127.0.0.1:33279', 'tcp://127.0.0.1:35067', 'tcp://127.0.0.1:33285']})
2024-07-02 15:37:40,564 - distributed.scheduler - WARNING - Worker tcp://127.0.0.1:44125 failed to acquire keys: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ('tcp://127.0.0.1:45323', 'tcp://127.0.0.1:44201', 'tcp://127.0.0.1:33279', 'tcp://127.0.0.1:35067', 'tcp://127.0.0.1:33285')}
2024-07-02 15:37:40,565 - distributed.scheduler - WARNING - Worker tcp://127.0.0.1:37827 failed to acquire keys: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ('tcp://127.0.0.1:45323', 'tcp://127.0.0.1:44201', 'tcp://127.0.0.1:33279', 'tcp://127.0.0.1:35067', 'tcp://127.0.0.1:33285')}
2024-07-02 15:39:52,768 - distributed.worker - WARNING - Could not find data: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ['tcp://127.0.0.1:46641', 'tcp://127.0.0.1:33285', 'tcp://127.0.0.1:37827', 'tcp://127.0.0.1:46849', 'tcp://127.0.0.1:42931']} on workers: [] (who_has: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ['tcp://127.0.0.1:46641', 'tcp://127.0.0.1:33285', 'tcp://127.0.0.1:37827', 'tcp://127.0.0.1:46849', 'tcp://127.0.0.1:42931']})
2024-07-02 15:39:52,816 - distributed.scheduler - WARNING - Worker tcp://127.0.0.1:32855 failed to acquire keys: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ('tcp://127.0.0.1:46641', 'tcp://127.0.0.1:33285', 'tcp://127.0.0.1:37827', 'tcp://127.0.0.1:46849', 'tcp://127.0.0.1:42931')}
2024-07-02 15:40:37,290 - distributed.worker - WARNING - Could not find data: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ['tcp://127.0.0.1:35067', 'tcp://127.0.0.1:33897', 'tcp://127.0.0.1:33279', 'tcp://127.0.0.1:46641', 'tcp://127.0.0.1:44847', 'tcp://127.0.0.1:46849']} on workers: [] (who_has: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ['tcp://127.0.0.1:35067', 'tcp://127.0.0.1:33897', 'tcp://127.0.0.1:33279', 'tcp://127.0.0.1:46641', 'tcp://127.0.0.1:44847', 'tcp://127.0.0.1:46849']})
2024-07-02 15:40:37,291 - distributed.scheduler - WARNING - Worker tcp://127.0.0.1:44201 failed to acquire keys: {'ndarray-a612cf0abd06497fa68e9db39636fedb': ('tcp://127.0.0.1:35067', 'tcp://127.0.0.1:33897', 'tcp://127.0.0.1:33279', 'tcp://127.0.0.1:46641', 'tcp://127.0.0.1:44847', 'tcp://127.0.0.1:46849')}

Expected behavior I didn't find a clear reproduction. But I find it often will appear in my after a run.

Package versions:

> # Name                    Version                   Build  Channel
_libgcc_mutex             0.1                 conda_forge    conda-forge
_openmp_mutex             4.5                  2_kmp_llvm    conda-forge
abseil-cpp                20211102.0           h27087fc_1    conda-forge
absl-py                   1.2.0              pyhd8ed1ab_0    conda-forge
adjusttext                0.8                      pypi_0    pypi
aiohttp                   3.8.1            py39hb9d737c_1    conda-forge
aiosignal                 1.2.0              pyhd8ed1ab_0    conda-forge
airr                      1.4.1                    pypi_0    pypi
anndata                   0.8.0              pyhd8ed1ab_1    conda-forge
annoy                     1.17.1                   pypi_0    pypi
antlr4-python3-runtime    4.8                      pypi_0    pypi
arboreto                  0.1.6                    pypi_0    pypi
argon2-cffi               21.3.0             pyhd8ed1ab_0    conda-forge
argon2-cffi-bindings      21.2.0           py39hb9d737c_2    conda-forge
asttokens                 2.0.5              pyhd8ed1ab_0    conda-forge
async-timeout             4.0.2              pyhd8ed1ab_0    conda-forge
attrs                     22.1.0             pyh71513ae_1    conda-forge
backcall                  0.2.0              pyh9f0ad1d_0    conda-forge
backports                 1.0                        py_2    conda-forge
backports.functools_lru_cache 1.6.4              pyhd8ed1ab_0    conda-forge
bbknn                     1.5.1                    pypi_0    pypi
beautifulsoup4            4.11.1             pyha770c72_0    conda-forge
binutils_impl_linux-64    2.38                 h2a08ee3_1    defaults
binutils_linux-64         2.38.0               hc2dff05_0    defaults
biopython                 1.81                     pypi_0    pypi
blas                      1.0                    openblas    defaults
bleach                    5.0.1              pyhd8ed1ab_0    conda-forge
blinker                   1.4                        py_1    conda-forge
bokeh                     2.4.3                    pypi_0    pypi
boltons                   23.0.0                   pypi_0    pypi
brotli                    1.0.9                h166bdaf_7    conda-forge
brotli-bin                1.0.9                h166bdaf_7    conda-forge
brotlipy                  0.7.0           py39hb9d737c_1004    conda-forge
bzip2                     1.0.8                h7b6447c_0    defaults
c-ares                    1.18.1               h7f98852_0    conda-forge
ca-certificates           2023.05.30           h06a4308_0    defaults
cached-property           1.5.2                hd8ed1ab_1    conda-forge
cached_property           1.5.2              pyha770c72_1    conda-forge
cachetools                5.0.0              pyhd8ed1ab_0    conda-forge
cellphonedb               4.0.0                    pypi_0    pypi
celltypist                1.2.0                    pypi_0    pypi
certifi                   2023.5.7         py39h06a4308_0    defaults
cffi                      1.15.0           py39hd667e15_1    defaults
changeo                   1.3.0                    pypi_0    pypi
charset-normalizer        2.1.0              pyhd8ed1ab_0    conda-forge
chex                      0.1.3              pyhd8ed1ab_1    conda-forge
click                     8.1.3            py39hf3d152e_0    conda-forge
cloudpickle               2.2.1                    pypi_0    pypi
colorama                  0.4.5              pyhd8ed1ab_0    conda-forge
commonmark                0.9.1                      py_0    conda-forge
cryptography              37.0.4           py39hd97740a_0    conda-forge
ctxcore                   0.2.0                    pypi_0    pypi
cycler                    0.11.0             pyhd8ed1ab_0    conda-forge
cython                    0.29.32                  pypi_0    pypi
cytoolz                   0.12.1                   pypi_0    pypi
dask                      2023.3.1                 pypi_0    pypi
dataclasses               0.8                pyhc8e2a94_3    conda-forge
datasets                  2.12.0                   pypi_0    pypi
debugpy                   1.6.0            py39h5a03fae_0    conda-forge
decorator                 5.1.1              pyhd8ed1ab_0    conda-forge
defusedxml                0.7.1              pyhd8ed1ab_0    conda-forge
dill                      0.3.6                    pypi_0    pypi
distance                  0.1.3                    pypi_0    pypi
distributed               2023.3.1                 pypi_0    pypi
dm-tree                   0.1.7            py39h1832856_0    conda-forge
docrep                    0.3.2              pyh44b312d_0    conda-forge
dotmap                    1.3.30                   pypi_0    pypi
entrypoints               0.4                pyhd8ed1ab_0    conda-forge
et_xmlfile                1.0.1                   py_1001    conda-forge
etils                     0.6.0              pyhd8ed1ab_0    conda-forge
executing                 0.9.1              pyhd8ed1ab_0    conda-forge
fbpca                     1.0                      pypi_0    pypi
filelock                  3.12.0                   pypi_0    pypi
flax                      0.5.2              pyhd8ed1ab_0    conda-forge
flit-core                 3.7.1              pyhd8ed1ab_0    conda-forge
fonttools                 4.34.4           py39hb9d737c_0    conda-forge
freetype                  2.10.4               h0708190_1    conda-forge
frozendict                2.3.5                    pypi_0    pypi
frozenlist                1.3.0            py39hb9d737c_1    conda-forge
fsspec                    2022.7.1           pyhd8ed1ab_0    conda-forge
future                    0.18.2           py39hf3d152e_5    conda-forge
gcc_impl_linux-64         11.2.0               h1234567_1    defaults
gcc_linux-64              11.2.0               h5c386dc_0    defaults
gdbm                      1.18                 hd4cb3f1_4    defaults
geneformer                0.0.1                    pypi_0    pypi
geosketch                 1.2                      pypi_0    pypi
giflib                    5.2.1                h36c2ea0_2    conda-forge
glibc                     2.12.2                        3    mgckind
google-auth               2.9.1              pyh6c4a22f_0    conda-forge
google-auth-oauthlib      0.4.6              pyhd8ed1ab_0    conda-forge
grpc-cpp                  1.46.3               hbd84cd8_2    conda-forge
grpcio                    1.46.3           py39h7ad10b3_2    conda-forge
gxx_impl_linux-64         11.2.0               h1234567_1    defaults
gxx_linux-64              11.2.0               hc2dff05_0    defaults
h5py                      3.7.0           nompi_py39h63b1161_100    conda-forge
harmonypy                 0.0.9                    pypi_0    pypi
hdf5                      1.12.1          nompi_h2386368_104    conda-forge
heapdict                  1.0.1                    pypi_0    pypi
huggingface-hub           0.14.1                   pypi_0    pypi
icu                       58.2                 he6710b0_3    defaults
idna                      3.3                pyhd8ed1ab_0    conda-forge
igblast                   1.19.0          pl5321h3928612_0    bioconda
igraph                    0.10.2                   pypi_0    pypi
imageio                   2.21.2                   pypi_0    pypi
imblearn                  0.0                      pypi_0    pypi
importlib-metadata        4.11.4           py39hf3d152e_0    conda-forge
importlib_metadata        4.11.4               hd8ed1ab_0    conda-forge
importlib_resources       5.9.0              pyhd8ed1ab_0    conda-forge
interlap                  0.2.7                    pypi_0    pypi
ipykernel                 6.15.1             pyh210e3f2_0    conda-forge
ipython                   8.4.0            py39hf3d152e_0    conda-forge
ipython_genutils          0.2.0                      py_1    conda-forge
ipywidgets                7.7.1              pyhd8ed1ab_0    conda-forge
jax                       0.3.15             pyhd8ed1ab_0    conda-forge
jaxlib                    0.3.14          cpu_py39h60ef58f_3    conda-forge
jedi                      0.18.1           py39hf3d152e_1    conda-forge
jinja2                    3.1.2              pyhd8ed1ab_1    conda-forge
joblib                    1.4.0                    pypi_0    pypi
jpeg                      9e                   h166bdaf_2    conda-forge
jsonschema                4.9.1              pyhd8ed1ab_0    conda-forge
jupyter_client            7.3.4              pyhd8ed1ab_0    conda-forge
jupyter_core              4.11.1           py39hf3d152e_0    conda-forge
jupyterlab_pygments       0.2.2              pyhd8ed1ab_0    conda-forge
jupyterlab_widgets        1.1.1              pyhd8ed1ab_0    conda-forge
kernel-headers_linux-64   2.6.32              he073ed8_15    conda-forge
keyutils                  1.6.1                h166bdaf_0    conda-forge
kiwisolver                1.4.4            py39hf939315_0    conda-forge
krb5                      1.19.3               h3790be6_0    conda-forge
ktplotspy                 0.1.10                   pypi_0    pypi
lcms2                     2.12                 hddcbb42_0    conda-forge
ld_impl_linux-64          2.38                 h1181459_1    defaults
leidenalg                 0.9.0                    pypi_0    pypi
lerc                      4.0.0                h27087fc_0    conda-forge
libblas                   3.9.0           15_linux64_openblas    conda-forge
libbrotlicommon           1.0.9                h166bdaf_7    conda-forge
libbrotlidec              1.0.9                h166bdaf_7    conda-forge
libbrotlienc              1.0.9                h166bdaf_7    conda-forge
libcblas                  3.9.0           15_linux64_openblas    conda-forge
libcurl                   7.83.1               h7bff187_0    conda-forge
libdeflate                1.12                 h166bdaf_0    conda-forge
libedit                   3.1.20191231         he28a2e2_2    conda-forge
libev                     4.33                 h516909a_1    conda-forge
libffi                    3.3                  he6710b0_2    defaults
libgcc-devel_linux-64     11.2.0               h1234567_1    defaults
libgcc-ng                 12.1.0              h8d9b700_16    conda-forge
libgfortran-ng            12.1.0              h69a702a_16    conda-forge
libgfortran5              12.1.0              hdcd56e2_16    conda-forge
libgomp                   12.1.0              h8d9b700_16    conda-forge
liblapack                 3.9.0           15_linux64_openblas    conda-forge
libnghttp2                1.47.0               h727a467_0    conda-forge
libopenblas               0.3.20          pthreads_h78a6416_1    conda-forge
libpng                    1.6.37               h753d276_3    conda-forge
libprotobuf               3.20.1               h6239696_0    conda-forge
libsodium                 1.0.18               h36c2ea0_1    conda-forge
libssh2                   1.10.0               ha56f1ee_2    conda-forge
libstdcxx-devel_linux-64  11.2.0               h1234567_1    defaults
libstdcxx-ng              12.1.0              ha89aaad_16    conda-forge
libtiff                   4.4.0                h0d92c0b_2    conda-forge
libwebp                   1.2.3                h522a892_1    conda-forge
libwebp-base              1.2.3                h166bdaf_2    conda-forge
libxcb                    1.13              h7f98852_1004    conda-forge
libxml2                   2.9.14               h74e7548_0    defaults
libzlib                   1.2.12               h166bdaf_2    conda-forge
llvm-openmp               14.0.4               he0ac6c6_0    conda-forge
llvmlite                  0.39.0                   pypi_0    pypi
locket                    1.0.0                    pypi_0    pypi
loompy                    3.0.7                    pypi_0    pypi
lz4                       4.3.2                    pypi_0    pypi
lz4-c                     1.9.3                h9c3ff4c_1    conda-forge
markdown                  3.4.1              pyhd8ed1ab_0    conda-forge
markupsafe                2.1.1            py39hb9d737c_1    conda-forge
matplotlib-base           3.5.2            py39h700656a_1    conda-forge
matplotlib-inline         0.1.3              pyhd8ed1ab_0    conda-forge
milopy                    0.1.1                    pypi_0    pypi
mistune                   0.8.4           py39h3811e60_1005    conda-forge
mizani                    0.9.2                    pypi_0    pypi
mkl                       2022.1.0           h84fe81f_915    conda-forge
msgpack-python            1.0.4            py39hf939315_0    conda-forge
mudata                    0.2.0              pyhd8ed1ab_0    conda-forge
multidict                 6.0.2            py39hb9d737c_1    conda-forge
multipledispatch          0.6.0                      py_0    conda-forge
multiprocess              0.70.14                  pypi_0    pypi
multiprocessing-on-dill   3.5.0a4                  pypi_0    pypi
munkres                   1.1.4              pyh9f0ad1d_0    conda-forge
muon                      0.1.5                    pypi_0    pypi
natsort                   8.1.0              pyhd8ed1ab_0    conda-forge
nbclient                  0.6.6              pyhd8ed1ab_0    conda-forge
nbconvert                 6.5.0              pyhd8ed1ab_0    conda-forge
nbconvert-core            6.5.0              pyhd8ed1ab_0    conda-forge
nbconvert-pandoc          6.5.0              pyhd8ed1ab_0    conda-forge
nbformat                  5.4.0              pyhd8ed1ab_0    conda-forge
ncbi-vdb                  2.11.0               h1b792b2_1    bioconda
ncurses                   6.3                  h5eee18b_3    defaults
nest-asyncio              1.5.5              pyhd8ed1ab_0    conda-forge
networkx                  2.8.5                    pypi_0    pypi
ninja                     1.11.0               h924138e_0    conda-forge
notebook                  6.4.12             pyha770c72_0    conda-forge
numba                     0.56.0                   pypi_0    pypi
numexpr                   2.8.4                    pypi_0    pypi
numpy                     1.22.4                   pypi_0    pypi
numpy-groupies            0.9.20                   pypi_0    pypi
numpyro                   0.10.0             pyhd8ed1ab_0    conda-forge
oauthlib                  3.2.0              pyhd8ed1ab_0    conda-forge
opencv-python             4.7.0.72                 pypi_0    pypi
openjpeg                  2.4.0                hb52868f_1    conda-forge
openpyxl                  3.0.9              pyhd8ed1ab_0    conda-forge
openssl                   1.1.1u               h7f8727e_0    defaults
opt_einsum                3.3.0              pyhd8ed1ab_1    conda-forge
optax                     0.1.3              pyhd8ed1ab_0    conda-forge
packaging                 21.3               pyhd8ed1ab_0    conda-forge
palettable                3.3.3                    pypi_0    pypi
pandas                    2.0.3                    pypi_0    pypi
pandoc                    2.18                 ha770c72_0    conda-forge
pandocfilters             1.5.0              pyhd8ed1ab_0    conda-forge
parso                     0.8.3              pyhd8ed1ab_0    conda-forge
partd                     1.3.0                    pypi_0    pypi
patsy                     0.5.2                    pypi_0    pypi
perl                      5.34.0               h5eee18b_2    defaults
pexpect                   4.8.0              pyh9f0ad1d_2    conda-forge
pickleshare               0.7.5                   py_1003    conda-forge
pillow                    7.2.0                    pypi_0    pypi
pip                       22.1.2           py39h06a4308_0    defaults
pkgutil-resolve-name      1.3.10             pyhd8ed1ab_0    conda-forge
plotnine                  0.10.1                   pypi_0    pypi
polyleven                 0.8                      pypi_0    pypi
presto                    0.7.1                    pypi_0    pypi
prometheus_client         0.14.1             pyhd8ed1ab_0    conda-forge
prompt-toolkit            3.0.30             pyha770c72_0    conda-forge
protobuf                  3.20.1           py39h5a03fae_0    conda-forge
psutil                    5.9.1            py39hb9d737c_0    conda-forge
pthread-stubs             0.4               h36c2ea0_1001    conda-forge
ptyprocess                0.7.0              pyhd3deb0d_0    conda-forge
pure_eval                 0.2.2              pyhd8ed1ab_0    conda-forge
pyarrow                   11.0.0                   pypi_0    pypi
pyasn1                    0.4.8                      py_0    conda-forge
pyasn1-modules            0.2.7                      py_0    conda-forge
pycparser                 2.21               pyhd8ed1ab_0    conda-forge
pydeprecate               0.3.2              pyhd8ed1ab_0    conda-forge
pygments                  2.12.0             pyhd8ed1ab_0    conda-forge
pyjwt                     2.4.0              pyhd8ed1ab_0    conda-forge
pynndescent               0.5.7                    pypi_0    pypi
pynverse                  0.1.4.4                  pypi_0    pypi
pyopenssl                 22.0.0             pyhd8ed1ab_0    conda-forge
pyparsing                 3.0.9              pyhd8ed1ab_0    conda-forge
pypoly2tri                0.0.3                    pypi_0    pypi
pyro-api                  0.1.2              pyhd8ed1ab_0    conda-forge
pyro-ppl                  1.8.1              pyhd8ed1ab_0    conda-forge
pyrsistent                0.18.1           py39hb9d737c_1    conda-forge
pyscenic                  0.12.1                   pypi_0    pypi
pysocks                   1.7.1            py39hf3d152e_5    conda-forge
python                    3.9.12               h12debd9_1    defaults
python-circos             0.3.0                    pypi_0    pypi
python-dateutil           2.8.2              pyhd8ed1ab_0    conda-forge
python-fastjsonschema     2.16.1             pyhd8ed1ab_0    conda-forge
python-flatbuffers        2.0                pyhd8ed1ab_0    conda-forge
python_abi                3.9                      2_cp39    conda-forge
pytorch                   1.12.0          cpu_py39h5d22d69_2    conda-forge
pytorch-lightning         1.6.5              pyhd8ed1ab_0    conda-forge
pytz                      2022.1             pyhd8ed1ab_0    conda-forge
pytz-deprecation-shim     0.1.0.post0              pypi_0    pypi
pyu2f                     0.1.5              pyhd8ed1ab_0    conda-forge
pywavelets                1.3.0                    pypi_0    pypi
pyyaml                    6.0              py39hb9d737c_4    conda-forge
pyzmq                     23.2.0           py39headdf64_0    conda-forge
re2                       2022.06.01           h27087fc_0    conda-forge
readline                  8.1.2                h7f8727e_1    defaults
regex                     2023.5.5                 pypi_0    pypi
requests                  2.28.1             pyhd8ed1ab_0    conda-forge
requests-oauthlib         1.3.1              pyhd8ed1ab_0    conda-forge
responses                 0.18.0                   pypi_0    pypi
rich                      11.1.0             pyhd8ed1ab_0    conda-forge
rpy2                      3.5.12                   pypi_0    pypi
rsa                       4.9                pyhd8ed1ab_0    conda-forge
sc-dandelion              0.3.2                    pypi_0    pypi
scanpy                    1.9.3                    pypi_0    pypi
scenic                    1.1.0                    pypi_0    pypi
scikit-image              0.19.3                   pypi_0    pypi
scikit-learn              0.24.0                   pypi_0    pypi
scipy                     1.9.0            py39h8ba3f38_0    conda-forge
scrublet                  0.2.3                    pypi_0    pypi
scvi-tools                0.17.1             pyhd8ed1ab_0    conda-forge
seaborn                   0.11.2                   pypi_0    pypi
send2trash                1.8.0              pyhd8ed1ab_0    conda-forge
session-info              1.0.0                    pypi_0    pypi
setuptools                59.5.0           py39hf3d152e_0    conda-forge
setuptools-scm            7.1.0                    pypi_0    pypi
shapely                   1.8.5.post1              pypi_0    pypi
six                       1.16.0             pyh6c4a22f_0    conda-forge
sleef                     3.5.1                h9b69904_2    conda-forge
sortedcontainers          2.4.0                    pypi_0    pypi
soupsieve                 2.3.2.post1        pyhd8ed1ab_0    conda-forge
sqlite                    3.39.0               h5082296_0    defaults
stack_data                0.3.0              pyhd8ed1ab_0    conda-forge
statsmodels               0.13.2                   pypi_0    pypi
stdlib-list               0.8.0                    pypi_0    pypi
sysroot_linux-64          2.12                he073ed8_15    conda-forge
tbb                       2021.5.0             h924138e_1    conda-forge
tblib                     1.7.0                    pypi_0    pypi
tensorboard               2.8.0            py39h06a4308_0    defaults
tensorboard-data-server   0.6.0            py39hd97740a_2    conda-forge
tensorboard-plugin-wit    1.8.1              pyhd8ed1ab_0    conda-forge
terminado                 0.15.0           py39hf3d152e_0    conda-forge
texttable                 1.6.4                    pypi_0    pypi
threadpoolctl             3.1.0              pyh8a188c0_0    conda-forge
tifffile                  2022.8.12                pypi_0    pypi
tinycss2                  1.1.1              pyhd8ed1ab_0    conda-forge
tk                        8.6.12               h1ccaba5_0    defaults
tokenizers                0.13.3                   pypi_0    pypi
tomli                     2.0.1                    pypi_0    pypi
toolz                     0.12.0             pyhd8ed1ab_0    conda-forge
torchmetrics              0.9.3              pyhd8ed1ab_0    conda-forge
tornado                   6.2              py39hb9d737c_0    conda-forge
tqdm                      4.64.0             pyhd8ed1ab_0    conda-forge
traitlets                 5.3.0              pyhd8ed1ab_0    conda-forge
transformers              4.29.1                   pypi_0    pypi
typing-extensions         4.3.0                hd8ed1ab_0    conda-forge
typing_extensions         4.3.0              pyha770c72_0    conda-forge
tzdata                    2023.3                   pypi_0    pypi
tzlocal                   4.3                      pypi_0    pypi
umap-learn                0.5.3                    pypi_0    pypi
unicodedata2              14.0.0           py39hb9d737c_1    conda-forge
urllib3                   1.26.11            pyhd8ed1ab_0    conda-forge
utils                     1.0.2                    pypi_0    pypi
wcwidth                   0.2.5              pyh9f0ad1d_2    conda-forge
webencodings              0.5.1                      py_1    conda-forge
werkzeug                  2.2.1              pyhd8ed1ab_0    conda-forge
wheel                     0.37.1             pyhd3eb1b0_0    defaults
widgetsnbextension        3.6.1              pyha770c72_0    conda-forge
xlrd                      1.2.0              pyh9f0ad1d_1    conda-forge
xorg-libxau               1.0.9                h7f98852_0    conda-forge
xorg-libxdmcp             1.1.3                h7f98852_0    conda-forge
xxhash                    3.2.0                    pypi_0    pypi
xz                        5.2.5                h7f8727e_1    defaults
yaml                      0.2.5                h7f98852_2    conda-forge
yamlordereddictloader     0.4.0                    pypi_0    pypi
yarl                      1.7.2            py39hb9d737c_2    conda-forge
zeromq                    4.3.4                h9c3ff4c_1    conda-forge
zict                      2.2.0                    pypi_0    pypi
zipp                      3.8.1              pyhd8ed1ab_0    conda-forge
zlib                      1.2.12               h7f8727e_2    defaults
zstd                      1.5.2                h8a70e8d_2    conda-forge

zhangdong360 commented 4 months ago

My code:

#!/bin/bash
#SBATCH -o output/pyscenic_hsc_sev.out
#SBATCH -e output/pyscenic_hsc_sev.err
#SBATCH --partition=compute
#SBATCH -J scenic_HSC_SEV
#SBATCH --nodes=1               
#SBATCH -n 30
# This is for fastp protocol

#conda activate scanpy
# human
#f_db_names="/share/home/zhangd/tools/database/cistarget/cisTarget_databases/homo_sapiens/hg38/refseq_r80/mc_v10_clust/gene_based/hg38_500bp_up_100bp_down_full_tx_v10_clust.genes_vs_motifs.rankings.feather"
#f_motif_path="/share/home/zhangd/tools/database/cistarget/Motif2TF/motifs-v10nr_clust-nr.hgnc-m0.001-o0.0.tbl"
#f_tf_list="/share/home/zhangd/project/python_project/pySCENIC/allTFs_hg38.txt"
# mouse
f_db_names="/home/zhangdong_2/database/cistarget/cisTarget_databases/mus_musculus/mm10/refseq_r80/mc_v10_clust/mm10_500bp_up_100bp_down_full_tx_v10_clust.genes_vs_motifs.rankings.feather"
f_motif_path="/home/zhangdong_2/database/cistarget/Motif2TF/motifs-v10nr_clust-nr.mgi-m0.001-o0.0.tbl"
f_tf_list="/home/zhangdong_2/database/cistarget/TF_lists/allTFs_mm.txt"
# data input

dir_result="/home/zhangdong_2/project/pySCENIC/03_result/HSC_SEV/"
input_loom="/home/zhangdong_2/project/pySCENIC/01_data/HSC_SEV.loom"

# step1
echo "Step 1 pyscenic grn start"
nohup pyscenic grn ${input_loom}  ${f_tf_list} \
              --seed 21 \
              --num_workers 16 \
              --method grnboost2 \
              --output ${dir_result}/step_1_fibo_grn.tsv >step1.out 2>&1 &
echo "Step 1 pyscenic grn finish"
echo "Step 2 pyscenic ctx start"
nohup pyscenic ctx ${dir_result}/step_1_fibo_grn.tsv  \
     ${f_db_names} \
     --annotations_fname ${f_motif_path} \
     --expression_mtx_fname ${input_loom} \
     --output ${dir_result}/step_2_reg.csv \
     --mask_dropouts \
     --num_workers 16 >step2.out 2>&1 &
echo "Step 2 pyscenic ctx finish"
echo "Step 3 pyscenic aucell start"
pyscenic aucell \
    ${input_loom} \
    ${dir_result}/step_2_reg.csv \
    --seed 21 \
    --output ${dir_result}/step_3_aucell.csv \
    --num_workers 16 >step_3.out 2>&1 &
echo "All finish"

I tried to use slurm distribution to compute nodes and directly in local bash operation, the result is the same. At the same time I run this code on the other operation platform, I found warning almost consistent, all point to the ndarray - a612cf0abd06497fa68e9db39636fedb. May be some help to found the problem?

zhangdong360 commented 4 months ago

My code:

#!/bin/bash
#SBATCH -o output/pyscenic_hsc_sev.out
#SBATCH -e output/pyscenic_hsc_sev.err
#SBATCH --partition=compute
#SBATCH -J scenic_HSC_SEV
#SBATCH --nodes=1               
#SBATCH -n 30
# This is for fastp protocol

#conda activate scanpy
# human
#f_db_names="/share/home/zhangd/tools/database/cistarget/cisTarget_databases/homo_sapiens/hg38/refseq_r80/mc_v10_clust/gene_based/hg38_500bp_up_100bp_down_full_tx_v10_clust.genes_vs_motifs.rankings.feather"
#f_motif_path="/share/home/zhangd/tools/database/cistarget/Motif2TF/motifs-v10nr_clust-nr.hgnc-m0.001-o0.0.tbl"
#f_tf_list="/share/home/zhangd/project/python_project/pySCENIC/allTFs_hg38.txt"
# mouse
f_db_names="/home/zhangdong_2/database/cistarget/cisTarget_databases/mus_musculus/mm10/refseq_r80/mc_v10_clust/mm10_500bp_up_100bp_down_full_tx_v10_clust.genes_vs_motifs.rankings.feather"
f_motif_path="/home/zhangdong_2/database/cistarget/Motif2TF/motifs-v10nr_clust-nr.mgi-m0.001-o0.0.tbl"
f_tf_list="/home/zhangdong_2/database/cistarget/TF_lists/allTFs_mm.txt"
# data input

dir_result="/home/zhangdong_2/project/pySCENIC/03_result/HSC_SEV/"
input_loom="/home/zhangdong_2/project/pySCENIC/01_data/HSC_SEV.loom"

# step1
echo "Step 1 pyscenic grn start"
nohup pyscenic grn ${input_loom}  ${f_tf_list} \
              --seed 21 \
              --num_workers 16 \
              --method grnboost2 \
              --output ${dir_result}/step_1_fibo_grn.tsv >step1.out 2>&1 &
echo "Step 1 pyscenic grn finish"
echo "Step 2 pyscenic ctx start"
nohup pyscenic ctx ${dir_result}/step_1_fibo_grn.tsv  \
     ${f_db_names} \
     --annotations_fname ${f_motif_path} \
     --expression_mtx_fname ${input_loom} \
     --output ${dir_result}/step_2_reg.csv \
     --mask_dropouts \
     --num_workers 16 >step2.out 2>&1 &
echo "Step 2 pyscenic ctx finish"
echo "Step 3 pyscenic aucell start"
pyscenic aucell \
    ${input_loom} \
    ${dir_result}/step_2_reg.csv \
    --seed 21 \
    --output ${dir_result}/step_3_aucell.csv \
    --num_workers 16 >step_3.out 2>&1 &
echo "All finish"

I tried to use slurm distribution to compute nodes and directly in local bash operation, the result is the same. At the same time I run this code on the other operation platform, I found warning almost consistent, all point to the ndarray - a612cf0abd06497fa68e9db39636fedb. May be some help to found the problem?

NOTE：It is worth mentioning that this is a different data on different platforms have been the same warning…And my sample book set can be successful operation.This doubt has been gnawed at me for a long time.

ghuls commented 4 months ago

Can you with try with the Docker/Podman/Singularity/Apptainer images instead? https://pyscenic.readthedocs.io/en/latest/installation.html#docker-podman-and-singularity-apptainer-images

JustMoveOnnn commented 4 months ago

My code:

#!/bin/bash
#SBATCH -o output/pyscenic_hsc_sev.out
#SBATCH -e output/pyscenic_hsc_sev.err
#SBATCH --partition=compute
#SBATCH -J scenic_HSC_SEV
#SBATCH --nodes=1               
#SBATCH -n 30
# This is for fastp protocol

#conda activate scanpy
# human
#f_db_names="/share/home/zhangd/tools/database/cistarget/cisTarget_databases/homo_sapiens/hg38/refseq_r80/mc_v10_clust/gene_based/hg38_500bp_up_100bp_down_full_tx_v10_clust.genes_vs_motifs.rankings.feather"
#f_motif_path="/share/home/zhangd/tools/database/cistarget/Motif2TF/motifs-v10nr_clust-nr.hgnc-m0.001-o0.0.tbl"
#f_tf_list="/share/home/zhangd/project/python_project/pySCENIC/allTFs_hg38.txt"
# mouse
f_db_names="/home/zhangdong_2/database/cistarget/cisTarget_databases/mus_musculus/mm10/refseq_r80/mc_v10_clust/mm10_500bp_up_100bp_down_full_tx_v10_clust.genes_vs_motifs.rankings.feather"
f_motif_path="/home/zhangdong_2/database/cistarget/Motif2TF/motifs-v10nr_clust-nr.mgi-m0.001-o0.0.tbl"
f_tf_list="/home/zhangdong_2/database/cistarget/TF_lists/allTFs_mm.txt"
# data input

dir_result="/home/zhangdong_2/project/pySCENIC/03_result/HSC_SEV/"
input_loom="/home/zhangdong_2/project/pySCENIC/01_data/HSC_SEV.loom"

# step1
echo "Step 1 pyscenic grn start"
nohup pyscenic grn ${input_loom}  ${f_tf_list} \
              --seed 21 \
              --num_workers 16 \
              --method grnboost2 \
              --output ${dir_result}/step_1_fibo_grn.tsv >step1.out 2>&1 &
echo "Step 1 pyscenic grn finish"
echo "Step 2 pyscenic ctx start"
nohup pyscenic ctx ${dir_result}/step_1_fibo_grn.tsv  \
     ${f_db_names} \
     --annotations_fname ${f_motif_path} \
     --expression_mtx_fname ${input_loom} \
     --output ${dir_result}/step_2_reg.csv \
     --mask_dropouts \
     --num_workers 16 >step2.out 2>&1 &
echo "Step 2 pyscenic ctx finish"
echo "Step 3 pyscenic aucell start"
pyscenic aucell \
    ${input_loom} \
    ${dir_result}/step_2_reg.csv \
    --seed 21 \
    --output ${dir_result}/step_3_aucell.csv \
    --num_workers 16 >step_3.out 2>&1 &
echo "All finish"

I tried to use slurm distribution to compute nodes and directly in local bash operation, the result is the same. At the same time I run this code on the other operation platform, I found warning almost consistent, all point to the ndarray - a612cf0abd06497fa68e9db39636fedb. May be some help to found the problem?

NOTE：It is worth mentioning that this is a different data on different platforms have been the same warning…And my sample book set can be successful operation.This doubt has been gnawed at me for a long time.

Totally same with your issue.

zhangdong360 commented 4 months ago

I rechecked the environment, probably because my conda environment was copied directly from another linux platform. In the first line of the python package, the python reference location needs to be updated. After I set up the new conda environment and ran it again, it went back to normal. But until I fix this problem, the small sample dataset still works fine, which makes me overlook possible problems with the environment configuration. Runs only pip install pyscenic will be dependent on the version problem. And here I can share my configuration method of conda environment. You need to run:

pip install numpy==1.22.4
pip install numexpr==2.8.4
pip install distributed==2023.12.1
pip install dask-expr==0.5.3
pip install dask==2023.12.1

And now you can run pyscenic successfully! By the way, We can open the dask dashboard to monitor the progress and memory usage of the task. Its default opened the way for the local IP: 8787. If port 8787 is occupied, the new port that the dask use will tip in the output. 微信图片_20240708161213

zhangdong360 commented 4 months ago

Can you with try with the Docker/Podman/Singularity/Apptainer images instead? https://pyscenic.readthedocs.io/en/latest/installation.html#docker-podman-and-singularity-apptainer-images

Sorry, I don't have root access on our server and it's hard to use docker.

zhangdong360 commented 4 months ago

My code:

#!/bin/bash
#SBATCH -o output/pyscenic_hsc_sev.out
#SBATCH -e output/pyscenic_hsc_sev.err
#SBATCH --partition=compute
#SBATCH -J scenic_HSC_SEV
#SBATCH --nodes=1               
#SBATCH -n 30
# This is for fastp protocol

#conda activate scanpy
# human
#f_db_names="/share/home/zhangd/tools/database/cistarget/cisTarget_databases/homo_sapiens/hg38/refseq_r80/mc_v10_clust/gene_based/hg38_500bp_up_100bp_down_full_tx_v10_clust.genes_vs_motifs.rankings.feather"
#f_motif_path="/share/home/zhangd/tools/database/cistarget/Motif2TF/motifs-v10nr_clust-nr.hgnc-m0.001-o0.0.tbl"
#f_tf_list="/share/home/zhangd/project/python_project/pySCENIC/allTFs_hg38.txt"
# mouse
f_db_names="/home/zhangdong_2/database/cistarget/cisTarget_databases/mus_musculus/mm10/refseq_r80/mc_v10_clust/mm10_500bp_up_100bp_down_full_tx_v10_clust.genes_vs_motifs.rankings.feather"
f_motif_path="/home/zhangdong_2/database/cistarget/Motif2TF/motifs-v10nr_clust-nr.mgi-m0.001-o0.0.tbl"
f_tf_list="/home/zhangdong_2/database/cistarget/TF_lists/allTFs_mm.txt"
# data input

dir_result="/home/zhangdong_2/project/pySCENIC/03_result/HSC_SEV/"
input_loom="/home/zhangdong_2/project/pySCENIC/01_data/HSC_SEV.loom"

# step1
echo "Step 1 pyscenic grn start"
nohup pyscenic grn ${input_loom}  ${f_tf_list} \
              --seed 21 \
              --num_workers 16 \
              --method grnboost2 \
              --output ${dir_result}/step_1_fibo_grn.tsv >step1.out 2>&1 &
echo "Step 1 pyscenic grn finish"
echo "Step 2 pyscenic ctx start"
nohup pyscenic ctx ${dir_result}/step_1_fibo_grn.tsv  \
     ${f_db_names} \
     --annotations_fname ${f_motif_path} \
     --expression_mtx_fname ${input_loom} \
     --output ${dir_result}/step_2_reg.csv \
     --mask_dropouts \
     --num_workers 16 >step2.out 2>&1 &
echo "Step 2 pyscenic ctx finish"
echo "Step 3 pyscenic aucell start"
pyscenic aucell \
    ${input_loom} \
    ${dir_result}/step_2_reg.csv \
    --seed 21 \
    --output ${dir_result}/step_3_aucell.csv \
    --num_workers 16 >step_3.out 2>&1 &
echo "All finish"

I tried to use slurm distribution to compute nodes and directly in local bash operation, the result is the same. At the same time I run this code on the other operation platform, I found warning almost consistent, all point to the ndarray - a612cf0abd06497fa68e9db39636fedb. May be some help to found the problem?

NOTE：It is worth mentioning that this is a different data on different platforms have been the same warning…And my sample book set can be successful operation.This doubt has been gnawed at me for a long time.

Totally same with your issue.

Maybe you can try reinstalling the environment as I did? my conda environment:

# packages in environment at /share/home/zhangd/.conda/envs/pyscenic:
#
# Name                    Version                   Build  Channel
_libgcc_mutex             0.1                        main    https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
_openmp_mutex             5.1                       1_gnu    https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
aiohttp                   3.9.5                    pypi_0    pypi
aiosignal                 1.3.1                    pypi_0    pypi
arboreto                  0.1.6                    pypi_0    pypi
async-timeout             4.0.3                    pypi_0    pypi
attrs                     23.2.0                   pypi_0    pypi
bokeh                     3.4.2                    pypi_0    pypi
boltons                   24.0.0                   pypi_0    pypi
ca-certificates           2024.3.11            h06a4308_0    https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
certifi                   2024.7.4                 pypi_0    pypi
charset-normalizer        3.3.2                    pypi_0    pypi
click                     8.1.7                    pypi_0    pypi
cloudpickle               3.0.0                    pypi_0    pypi
contourpy                 1.2.1                    pypi_0    pypi
ctxcore                   0.2.0                    pypi_0    pypi
cytoolz                   0.12.3                   pypi_0    pypi
dask                      2023.12.1                pypi_0    pypi
dask-expr                 0.5.3                    pypi_0    pypi
dill                      0.3.8                    pypi_0    pypi
distributed               2023.12.1                pypi_0    pypi
frozendict                2.4.4                    pypi_0    pypi
frozenlist                1.4.1                    pypi_0    pypi
fsspec                    2024.6.1                 pypi_0    pypi
h5py                      3.11.0                   pypi_0    pypi
idna                      3.7                      pypi_0    pypi
importlib-metadata        8.0.0                    pypi_0    pypi
interlap                  0.2.7                    pypi_0    pypi
jinja2                    3.1.4                    pypi_0    pypi
joblib                    1.4.2                    pypi_0    pypi
ld_impl_linux-64          2.38                 h1181459_1    https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
libffi                    3.3                  he6710b0_2    https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
libgcc-ng                 11.2.0               h1234567_1    https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
libgomp                   11.2.0               h1234567_1    https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
libstdcxx-ng              11.2.0               h1234567_1    https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
llvmlite                  0.43.0                   pypi_0    pypi
locket                    1.0.0                    pypi_0    pypi
loompy                    3.0.7                    pypi_0    pypi
lz4                       4.3.3                    pypi_0    pypi
markupsafe                2.1.5                    pypi_0    pypi
msgpack                   1.0.8                    pypi_0    pypi
multidict                 6.0.5                    pypi_0    pypi
multiprocessing-on-dill   3.5.0a4                  pypi_0    pypi
ncurses                   6.4                  h6a678d5_0    https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
networkx                  3.2.1                    pypi_0    pypi
numba                     0.60.0                   pypi_0    pypi
numexpr                   2.8.4                    pypi_0    pypi
numpy                     1.22.4                   pypi_0    pypi
numpy-groupies            0.11.1                   pypi_0    pypi
openssl                   1.1.1w               h7f8727e_0    https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
packaging                 24.1                     pypi_0    pypi
pandas                    2.2.2                    pypi_0    pypi
partd                     1.4.2                    pypi_0    pypi
pillow                    10.4.0                   pypi_0    pypi
pip                       24.0             py39h06a4308_0    https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
psutil                    6.0.0                    pypi_0    pypi
pyarrow                   16.1.0                   pypi_0    pypi
pyarrow-hotfix            0.6                      pypi_0    pypi
pynndescent               0.5.13                   pypi_0    pypi
pyscenic                  0.12.1                   pypi_0    pypi
python                    3.9.12               h12debd9_1    https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
python-dateutil           2.9.0.post0              pypi_0    pypi
pytz                      2024.1                   pypi_0    pypi
pyyaml                    6.0.1                    pypi_0    pypi
readline                  8.2                  h5eee18b_0    https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
requests                  2.32.3                   pypi_0    pypi
scikit-learn              1.5.1                    pypi_0    pypi
scipy                     1.13.1                   pypi_0    pypi
setuptools                69.5.1           py39h06a4308_0    https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
six                       1.16.0                   pypi_0    pypi
sortedcontainers          2.4.0                    pypi_0    pypi
sqlite                    3.45.3               h5eee18b_0    https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
tblib                     3.0.0                    pypi_0    pypi
threadpoolctl             3.5.0                    pypi_0    pypi
tk                        8.6.14               h39e8969_0    https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
toolz                     0.12.1                   pypi_0    pypi
tornado                   6.4.1                    pypi_0    pypi
tqdm                      4.66.4                   pypi_0    pypi
tzdata                    2024.1                   pypi_0    pypi
umap-learn                0.5.6                    pypi_0    pypi
urllib3                   2.2.2                    pypi_0    pypi
wheel                     0.43.0           py39h06a4308_0    https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
xyzservices               2024.6.0                 pypi_0    pypi
xz                        5.4.6                h5eee18b_1    https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main
yarl                      1.9.4                    pypi_0    pypi
zict                      3.0.0                    pypi_0    pypi
zipp                      3.19.2                   pypi_0    pypi
zlib                      1.2.13               h5eee18b_1    https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main

ghuls commented 3 months ago

Can you with try with the Docker/Podman/Singularity/Apptainer images instead? https://pyscenic.readthedocs.io/en/latest/installation.html#docker-podman-and-singularity-apptainer-images

Sorry, I don't have root access on our server and it's hard to use docker.

@zhangdong360 Could you try this? I recently found dockerc which allows to create a binary from a docker image and which does not require root access for running the final binary. If it works well, it would be a good alternative for apptainer/singulariy/docker/podman for HPC systems that don't have any of them installed.

# Download pyscenic binary made with dockerc from pyscenic docker image.
wget https://resources.aertslab.org/cistarget/pyscenic_0.12.1

# Make pySCENIC binary executable
chmod a+x pyscenic_0.12.1.

# Start bash in pySCENIC executable and mount local data path in container as in normal Docker:
#   https://pyscenic.readthedocs.io/en/latest/installation.html#docker-podman-and-singularity-apptainer-images
/pyscenic_0.12.1 -v /data:/data -c 'import os; os.environ["COLUMNS"] = "80"; os.system("bash")'

Inside this bash, you should be able to run pySCENIC now:

$ /pyscenic_0.12.1 -v /data:/data -c 'import os; os.environ["COLUMNS"] = "80"; os.system("bash")'
unknown argument ignored: lazytime
root@umoci-default:/# pyscenic
usage: pyscenic [-h] {grn,add_cor,ctx,aucell} ...

Single-Cell rEgulatory Network Inference and Clustering
(0.12.1+0.gce41b61.dirty)

positional arguments:
  {grn,add_cor,ctx,aucell}
                        sub-command help
    grn                 Derive co-expression modules from expression matrix.
    add_cor             [Optional] Add Pearson correlations based on TF-gene
                        expression to the network adjacencies output from the
                        GRN step, and output these to a new adjacencies file.
                        This will normally be done during the "ctx" step.
    ctx                 Find enriched motifs for a gene signature and
                        optionally prune targets from this signature based on
                        cis-regulatory cues.
    aucell              Quantify activity of gene signatures across single
                        cells.

options:
  -h, --help            show this help message and exit

Arguments can be read from file using a @args.txt construct. For more
information on loom file format see http://loompy.org . For more information
on gmt file format see https://software.broadinstitute.org/cancer/software/gse
a/wiki/index.php/Data_formats .

Flu09 commented 1 month ago

@ghuls [mo@lm02-16 test_pyscenic]$ /pyscenic_0.12.1 -v /data:/data -c 'import os; os.environ["COLUMNS"] = "80"; os.system("bash")' bash: /pyscenic_0.12.1: No such file or directory [mo@lm02-16 test_pyscenic]$ ls pyscenic_0.12.1

it did not work for me. Also using the singularity image did not work. it stops at creating dask graph and keeps running forever

sherinesaber commented 1 month ago

Hello @ghuls I converted the docker image into a singularity image. Dask used to work fine (i test on 100k loom file) then now it works only if few thousands cells (maximum 10k and low numbers of workers say 20) . Like Flu90 mentioned it prints creating Dask graph and does not proceed. I am working on HPC so i can request high number of workers maximim 40 and 3T of RAM.

lidoctor commented 1 month ago

The log contains multiple warnings from the Numba library, specifically regarding the usage of the nopython argument. This warning is not critical to the execution of your code but may affect performance in future versions. The distributed.scheduler is the scheduler in Dask, responsible for coordinating distributed computing tasks, allocating worker nodes to execute tasks, and managing data distribution. Dask has a certain level of fault tolerance and will attempt to reassign tasks and data to other nodes, but this depends on data availability and cluster configuration.

You can try reducing the level of parallelism (by lowering --num_workers) to ease the load on the nodes.

aertslab / pySCENIC

[BUG] run in pyscenic grn #559