dkoslicki / CMash

Fast and accurate set similarity estimation via containment min hash
BSD 3-Clause "New" or "Revised" License
42 stars 9 forks source link

Bloom filter for pre-filter of kmers #4

Closed dkoslicki closed 6 years ago

dkoslicki commented 6 years ago

Try creating a pre-filter consisting of all kmers (for all kmer sizes) of all the hashes and their prefixes, so I don't have to hit the trie so frequently.