kamimrcht / REINDEER

REINDEER REad Index for abuNDancE quERy
GNU Affero General Public License v3.0
56 stars 6 forks source link

Skip kmers with N instead of skipping entire sequences with N #27

Open tbenavi1 opened 2 weeks ago

tbenavi1 commented 2 weeks ago

Hello,

Currently Reindeer will skip any queried sequence with Ns. However, I was wondering if would instead be possible to only skip any kmers with Ns (or actually, to output a count of 0 for any such kmers, in order to maintain the positions in the original query sequence). For example, bcalm can run on sequences with N's and it will simply skip any kmers that overlap N. Thanks for your consideration.