mcveanlab / mccortex

De novo genome assembly and multisample variant calling
https://github.com/mcveanlab/mccortex/wiki
MIT License
113 stars 25 forks source link

What does mccortex build do if it encounters Ns in the read? #52

Closed yeban closed 7 years ago

yeban commented 7 years ago

Are k-mers containing N automatically ignored?

noporpoise commented 7 years ago

Yes, sequences are broken up at N bases.

For example, the input AACNNCTA gives kmers AAC and CTA:

$ bin/mccortex31 build -q -k 3 --sample Test --seq <(echo 'AACNNCTA') - | bin/mccortex31 view -q --kmers -
AAC 1 ........
CTA 1 ........
yeban commented 7 years ago

Thanks.