onecodex / finch-rs

A genomic minhashing implementation in Rust
https://www.onecodex.com
MIT License
92 stars 8 forks source link

Allow different sketches in the same multisketch to have different kmer sizes #27

Open bovee opened 5 years ago

bovee commented 5 years ago

This would require some kind of extension to how we store sketches (neither the Mash binary format nor the JSON format support "Multisketches" with different sketching parameters). This is mostly useful for sketching algorithms that can use sketches of different kmer sizes to impute things about the core vs accessory genomic distance (i.e. see https://github.com/johnlees/PopPUNK), but it's possible there are other uses.