Closed ddpinto closed 4 years ago
1) Is your sample targeted?
2) Running 8x 8M cells is really heavy.
3) Are you starting with HiFi or unpolished CCS? Maybe try running CCS with --min-passes 2 --min-rq 0.9
and then feed that into the Iso-Seq pipeline.
Samples are not targeted. I am starting from HiFi reads and running CCS with --min-passes 3 --min-rq 0.9
. I could share datafiles or core dump (though it is very big) for troubleshooting.
I suspect that you simply run out of memory. We never ran more than 3-4 8M cells at once, so far. The only thing you can do for now, is use a high-memory compute server. Reducing the memory footprint is not an easy task and so far you are the first one to report issues. If that solution leaves you unhappy, feel free to report an issue with PacBio's tech support, but even if we were to implement changes, it won't be near term.
I am already running on a 1 Tb node but when the crash occurs the process is using "only" 400 Gb.
Operating system Linux
Package name IsoSeq3 version 3.3.0
Describe the bug When running isoseq3 cluster with data from 8 2M sequel2 SMRT cells it consistently results in segmentation faults and core dumps. Running cells individually, or in e.g. sets of 4 works well, but combining them in larger sets results in crashes. At the time of the crashes the cluster job is using >400 Gb of RAM and the core dumps are of similar size.
Error message line 23: 14606 Segmentation fault (core dumped)
Conda environment