ShawHahnLab / umbra

Python package and executable for Linux for managing Illumina sequencing runs
GNU Affero General Public License v3.0
3 stars 0 forks source link

Contig assembly task mis-numbers contigs #98

Closed ressy closed 4 years ago

ressy commented 4 years ago

In TaskAssemble.prep_contigs_for_geneious the contig numbering is mangled by an incorrect regular expression. re.match("^NODE_([0-9])+_.*", rec.id) should be re.match("^NODE_([0-9]+)_.*", rec.id) with the capture group including all of the digits. As it currently stands the same one-digit labels are recycled over and over.