dib-lab / khmer

In-memory nucleotide sequence k-mer counting, filtering, graph traversal and more
http://khmer.readthedocs.io/
Other
749 stars 294 forks source link

Try out forgetful Bloom filters #1198

Open ctb opened 9 years ago

ctb commented 9 years ago

Via @camillescott:

http://dprg.cs.uiuc.edu/docs/fbf_cac15/fbfpaper-2.pdf

ctb commented 9 years ago

One interesting random (that's not directly about FBF) is to adapt this idea/cascading bloom filters to solve pre-memory allocation problem. Briefly,

This could even speed things up quite a bit b/c of NUMA memory locality...

ctb commented 7 years ago

Or we could try cuckoo filters, http://blog.fastforwardlabs.com/post/153566952648/probabilistic-data-structure-showdown-cuckoo