onecodex / finch-rs

A genomic minhashing implementation in Rust
https://www.onecodex.com
MIT License
93 stars 8 forks source link

Non-canonical kmers #1

Closed bovee closed 7 years ago

bovee commented 7 years ago

Add an option to allow sketching that doesn't perform canonicalization prior to sketching to allow e.g. adapter identification (adapters are extremely skewed in the number of counts they have in the forward versus reverse directions).

bovee commented 7 years ago

Although we're not exposing this in terms of saving sketches, we now do this behind the scenes for the strand-filter.