refresh-bio / KMC

Fast and frugal disk based k-mer counter
266 stars 73 forks source link

drop kmers that only appear at one position in reads #78

Open notestaff opened 6 years ago

notestaff commented 6 years ago

Sometimes a kmer might appear in different reads, so it looks real, but all these reads are duplicates. It would be good if there was an option to drop kmers that only ever appear at one particular read position. Or, more generally, only keep kmers for which the difference between the min and max position of the kmer in a read is at least N.