medvedevgroup / TwoPaCo

A fast constructor of the compressed de Bruijn graph from many genomes
Other
39 stars 10 forks source link

Handeling N characters in sequences #6

Closed ChriKub closed 7 years ago

ChriKub commented 7 years ago

Hi,

when using input sequences that contain N characters (which almost all eukaryote reference sequences do) I get this error message:

Round 0, 0:1048576 Pass Filling Filtering 1 error: Found an invalid character 'N'

Do I need to re-split my input into contigs to use it with TwoPaCo, or is this some kind of bug?

Thanks, Chris

ChriKub commented 7 years ago

Solved by updating and new install.