dib-lab / sourmash-slainte

Project template for sourmash-based characterization of genomes and metagenomes
BSD 3-Clause "New" or "Revised" License
1 stars 0 forks source link

sketch data files individually first, then combine sketches? #7

Closed ctb closed 7 months ago

ctb commented 8 months ago

https://github.com/dib-lab/sourmash-slainte/pull/2 supports some flexibility in sketching multiple data files into a single sketch.

if we want to use manysketch from sourmash_plugin_branchwater, we either need to fix manysketch to allow multiple files -> one sketch, or we could instead first sketch individual data files and then combine them with sourmash sig merge. The latter would also allow us to do various kinds of diagnostics on the individual data files, like show containment and overlap stuff. Not sure how useful it is tho?

viz https://github.com/sourmash-bio/sourmash_plugin_branchwater/issues/169