Kennedy-Lab-UW / Duplex-Seq-Pipeline

A standalone end-to-end data analysis pipeline for Duplex Sequencing
Other
21 stars 9 forks source link

Implement consensus_caller speedup #96

Closed bkohrn closed 2 years ago

bkohrn commented 3 years ago

THIS IS A API-INCOMPATIBLE CHANGE Implement a speedup to the consensus_caller in UnifiedConsensusMaker.py Uses a combination of Counters and cython to gain a 5x speedup in consensus_caller, equating to a 2x speedup in overall operation of UnifiedConsensusMaker.py, including UMI processing time. Also makes use of a pysam function to more efficiently convert quality scores into strings prior to write-out. API-incompatible due to the use of Cython in the consensus_caller speedup.