gui11aume / starcode

All pairs search and sequence clustering
GNU General Public License v3.0
90 stars 21 forks source link

Lookup search bug #7

Closed ezorita closed 10 years ago

ezorita commented 10 years ago

I found a bug in the lookup-table search algorithm for specific sequences. The error is systematic for some sequences that have exactly tau insertions, e.g.:

canonical: CCGGCCCGCGACGAGATCTGGGCCAACCTAAGAGCCGCTAAGCCCCGGCA mutated: CCGGCCCGCGACGAGATCTGGGCCAACCTAAGAGCCGCGTAGAGCCCCGGCCA c: AACGCGCTAACCGGTGCACTAAAGCGAGACTAGTGAGATTACGACACAGA m: AACGCGCTAACCGAGTGCACTAAAGCGAGAGCTAGTGAGACTTACGACACAGA c: CACTTCTAGAAGGCGCTCATATCACGCCTGGACCGATGTGTTGTTTAGAT m: CACTTCTGAGAAGGCGCTCATATCACGCCTGGGACCGATGTGTTGTTTGAGA

All these matches are identified when lookup_search() is disabled.