Open alimayy opened 6 years ago
Would it be possible to split the las file and run HINGE with --mlas
?
Hi Govinda, it seems like it worked, I'll let you know after I have a more thorough evaluation. By the way, what I also realised is the huge number of 'las's that are produced during
HPC.daligner -t5 -T32 hinge| csh -v > /dev/null 2>&
(see attached)
I have the feeling that this causes the pipeline to take too long. Would you agree?
Hi guys,
Recently we've realised that the way we run the HINGE pipeline causes a process in the chain getting killed for large PacBio and ONT datasets. I think this is related to issue 130 where a large, single las file is being read into the memory. The dataset and las file properties as follows:
PacBio dataset: 1,282,848 reads, 6.4 Gb yield hinge.las file size: 138Gb
ONT dataset: 756,656 reads, 6.4 Gb yield hinge.las (gzipped) file size: 104G
I'm pasting the log from our wrapper and the error for the PacBio dataset (the error for the ONT dataset was the same). What should I add in the pipeline to prevent this issue?