Open Mira0507 opened 9 months ago
How much memory did you assign to the job? Try lowering the number of workers.
Interesting. I got it worked out after changing the number of cpus from 32 to 8. (memory was 100g)
Singularity> cat pyscenic_ctx_test.sh
#!/bin/bash
pyscenic ctx \
scrnaseq-pyscenic-tac1-chat/WT_all_adj.csv \
cistarget-db/mm10_500bp_up_100bp_down_full_tx_v10_clust.genes_vs_motifs.rankings.feather \
cistarget-db/mm10_10kbp_up_10kbp_down_full_tx_v10_clust.genes_vs_motifs.rankings.feather \
--annotations_fname cistarget-db/motifs-v10nr_clust-nr.mgi-m0.001-o0.0.tbl \
--expression_mtx_fname scrnaseq-pyscenic-tac1-chat/WT_all.loom \
-o scrnaseq-pyscenic-tac1-chat/WT_all_reg.csv \
--mask_dropouts \
--num_workers 8
ingularity> bash pyscenic_ctx_test.sh
(running...)
2024-02-14 15:44:04,062 - pyscenic.prune - INFO - Worker mm10_10kbp_up_10kbp_down_full_tx_v10_clust.genes_vs_motifs.rankings(3): All regulons derived.
2024-02-14 15:44:04,062 - pyscenic.prune - INFO - Worker mm10_10kbp_up_10kbp_down_full_tx_v10_clust.genes_vs_motifs.rankings(3): All regulons derived.
2024-02-14 15:44:04,081 - pyscenic.prune - INFO - Worker mm10_10kbp_up_10kbp_down_full_tx_v10_clust.genes_vs_motifs.rankings(3): Done.
2024-02-14 15:44:04,081 - pyscenic.prune - INFO - Worker mm10_10kbp_up_10kbp_down_full_tx_v10_clust.genes_vs_motifs.rankings(3): Done.
2024-02-14 15:44:04,147 - pyscenic.cli.pyscenic - INFO - Writing results to file.
$ la | grep WT_all_reg.csv
-rw-rw----. 1 username xxxx 2.5M Feb 14 15:44 WT_all_reg.csv
I've never thought about more cpus causing problems. Do you have any guess about this situation?
Thank you so much for the discussion, @ghuls!
Each worker loads the databases again, so you need memory for this.
I've been seeing my
pyscenic ctx
gets terminated in the middle of the run. My mystery is that it doesn't throw any error message as shown below:It comes back to my terminal without the
Writing an output file
message, which I'm expecting to see if it was successful.My script is following:
This run ended up creating an empty (zero byte) output file shown here:
I'm assuming that my issue is unrelated to unmatched gene symbols across the input files as I found that over 1000 gene symbols intersected across the ranking (
.feather
), adjacency (adj.csv
), TFs (allTFs_mm.txt
), and motif (.tbl
) files.Some testings done so far:
--no_pruning
parameter--mask_dropouts
parametermm10__refseq-r80__10kb_up_and_down_tss.mc9nr.genes_vs_motifs.rankings.feather
,mm10__refseq-r80__500bp_up_and_100bp_down_tss.mc9nr.genes_vs_motifs.rankings.feather
) and/or v9 motif file (motifs-v9-nr.mgi-m0.001-o0.0.tbl
)I saw a few issue reports on empty output but at least they got
Writing an output file
message.My environment was HPC being summarized here:
Single-Cell rEgulatory Network Inference and Clustering (0.12.1+0.gce41b61.dirty)