Closed IvantheDugtrio closed 6 years ago
I reran the samples with an intersected bed file for my panel and with 3 threads used. So far it's running much faster and has no such memory bottlenecks.
Also is there any notable difference between running this with Python3 versus Python2.7? Our compute cluster runs CentOS 7 which has been tricky to get Python3 up and running on.
There should be no notable difference between Python 2.7 vs Python 3. However, I must reiterate that comparing mismatched tumor and normal samples is very likely to produce a high rate of false-positive MSI calls.
Hello @IvantheDugtrio, do you have any further questions regarding this issue?
Hi @rbonneville, nah this seems to have been worked out. I'm closing this now.
I noticed my processes would start failing after it finished running kmer_repeat_counter.py for several hours because mantis.py would consume all of the available system memory plus the entire swapfile. I ran this on a server with 88 processing threads and 192GB of RAM. I ran with the --threads 88 option.
I am analyzing targeted sequencing reads covering about 0.9 Mb, all CDS. I am also using a pool of normals as my normal control since we don't have the normal tissue for these samples.
Also I generated my loci.bed file from the GRCh37.fa reference genome.
How much RAM does mantis.py need per thread?