cnobles / iGUIDE

Bioinformatic pipeline for identifying dsDNA breaks by marker based incorporation, such as breaks induced by designer nucleases like Cas9.
https://iguide.readthedocs.io/en/latest/
GNU General Public License v3.0
20 stars 9 forks source link

Memory explosion #84

Closed ShanSabri closed 3 years ago

ShanSabri commented 3 years ago

Hi @cnobles - I'm trying to process 12 samples that total to 100M reads. I'm using an r5d.16xlarge (64 cores, 512 GiB) EC2 instance. When doing so the instance exceeds the max memory during the demux rule and is ultimately killed. This seems a bit crazy to me. Any ideas on how to get these samples to pass through?

image

ShanSabri commented 3 years ago

As a work around -- I've demultiplexed the i7 index into 6 pairs of samples (each pair share the i7 index) and am processing these binned samples independently.