sheffield-bioinformatics-core / periscope

A tool to quantify sub-genomic RNA (sgRNA) expression in SARS-CoV-2 artic network amplicon nanopore sequencing data.
GNU General Public License v3.0
16 stars 5 forks source link

Move over vangaurd optimisations to Illumina data #18

Closed mattdmem closed 3 years ago

mattdmem commented 3 years ago

I implemented multi-threading in the nanopore version of periscope, this is achieved by splitting the original bam file into parts (how many parts is determined by the user specified amount of threads). These then run in parallel.

Periscope seems to run slow (a few days using 1 or 2 threads) when there are more than 100 million reads and requires relatively large memory (>64gb). Do you have any suggestion for speeding up the analysis?

This user has noticed it is pretty slow on very large datasets for Illumina so it would be good to migrate these changes over to the Illumia search functions!

mbdabrowska1 commented 3 years ago