edilytics / CRISPResso2

Analysis of deep sequencing data for rapid and intuitive interpretation of genome editing experiments
Other
0 stars 1 forks source link

Fastq Alignment Parallelization #90

Closed trevormartinj7 closed 2 months ago

trevormartinj7 commented 3 months ago

This project splits the up the work of creating multiple variant objects for .fastq files across multiple processes, greatly speeding up processing time for processing of all .fastq files.

It uses python's multiprocessing library to spin up processes which then create variant objects in parallel. It then stores them in intermediate .tsv files, which are read to track statistics of the files.