dnbaker / dashing

Fast and accurate genomic distances using HyperLogLog
GNU General Public License v3.0
161 stars 11 forks source link

Add protein k-mers with 6-frame translation #11

Open olgabot opened 5 years ago

olgabot commented 5 years ago

Hello, I've been using protein k-mers from sourmash to compare single-cell RNA-seq profiles across different species. I'm curious to benchmark dashing for this purpose as well. Warmest, Olga

dnbaker commented 5 years ago

It’d be useful and worth consideration. Which encoding scheme(s) to support and doing so efficiently (IE, comparably to our nucleic acid support) wouldn’t necessarily be trivial, but we’ve looked into it and will be considering this enhancement.