if we want to use manysketch from sourmash_plugin_branchwater, we either need to fix manysketch to allow multiple files -> one sketch, or we could instead first sketch individual data files and then combine them with sourmash sig merge. The latter would also allow us to do various kinds of diagnostics on the individual data files, like show containment and overlap stuff. Not sure how useful it is tho?
https://github.com/dib-lab/sourmash-slainte/pull/2 supports some flexibility in sketching multiple data files into a single sketch.
if we want to use
manysketch
from sourmash_plugin_branchwater, we either need to fix manysketch to allow multiple files -> one sketch, or we could instead first sketch individual data files and then combine them withsourmash sig merge
. The latter would also allow us to do various kinds of diagnostics on the individual data files, like show containment and overlap stuff. Not sure how useful it is tho?viz https://github.com/sourmash-bio/sourmash_plugin_branchwater/issues/169