pySCENIC is a lightning-fast python implementation of the SCENIC pipeline (Single-Cell rEgulatory Network Inference and Clustering) which enables biologists to infer transcription factors, gene regulatory networks and cell types from single-cell RNA-seq data.
Describe the bug
Hi all,
Hi, I kept having this error: 'OverflowError: cannot serialize a bytes object larger than 4 GiB' when I run arboreto_with_multiprocessing.py \
sample.loom \
$tfs \
--method grnboost2 \
--output adj.sample.tsv \
--num_workers 40 \
--seed 777
Do you have any suggestion how to fixed this error?
Mote that most errors are due to the input from the user, and therefore should be treated as questions in the Discussions. Please, only report them as bugs if you are quite certain that they are not behaving as expected.
Steps to reproduce the behavior
Command run when the error occurred:
conda activate pyscenic
cd /home/data/t220416/Melanoma/3_pyscenic/result_AM
cat >change.py
import os,sys
os.getcwd()
os.listdir(os.getcwd())
import loompy as lp;
import numpy as np;
import scanpy as sc;
x=sc.read_csv("for.scenic.data.csv");
row_attrs = {"Gene": np.array(x.var_names),};
col_attrs = {"CellID": np.array(x.obs_names)};
lp.create("sample.loom",x.X.transpose(),row_attrs,col_attrs);
Describe the bug Hi all, Hi, I kept having this error: 'OverflowError: cannot serialize a bytes object larger than 4 GiB' when I run arboreto_with_multiprocessing.py \ sample.loom \ $tfs \ --method grnboost2 \ --output adj.sample.tsv \ --num_workers 40 \ --seed 777
Do you have any suggestion how to fixed this error?
Steps to reproduce the behavior
conda activate pyscenic cd /home/data/t220416/Melanoma/3_pyscenic/result_AM cat >change.py import os,sys os.getcwd() os.listdir(os.getcwd()) import loompy as lp; import numpy as np; import scanpy as sc; x=sc.read_csv("for.scenic.data.csv"); row_attrs = {"Gene": np.array(x.var_names),}; col_attrs = {"CellID": np.array(x.obs_names)}; lp.create("sample.loom",x.X.transpose(),row_attrs,col_attrs);
python change.py
cat >scenic.bash
dir=/home/data/t220416/Melanoma/3_pyscenic/0_data/index_genome/cisTarget_databases/hg38
tfs=$dir/hs_hgnc_tfs.txt feather=$dir/hg38_10kbp_up_10kbp_down_full_tx_v10_clust.genes_vs_motifs.rankings.feather tbl=$dir/motifs-v10nr_clust-nr.hgnc-m0.001-o0.0.tbl
input_loom=./sample.loom ls $tfs $feather $tbl
arboreto_with_multiprocessing.py \ sample.loom \ $tfs \ --method grnboost2 \ --output adj.sample.tsv \ --num_workers 40 \ --seed 777
pyscenic ctx \ adj.sample.tsv $feather \ --annotations_fname $tbl \ --expression_mtx_fname $input_loom \ --mode "dask_multiprocessing" \ --output reg.csv \ --num_workers 20 \ --mask_dropouts
pyscenic aucell \ $input_loom \ reg.csv \ --output out_SCENIC.loom \ --num_workers 16
nohup bash scenic.bash 1>pySCENIC.log 2>&1 &
Error encountered:
Expected behavior A clear and concise description of what you expected to happen.
Please complete the following information:
pip freeze
,conda list
, or skip this if using Docker/Singularity]: