Open ctb opened 2 years ago
thinking -
more generally, we should describe the math for all of the various similarity calculations used:
as well as point out which ones are distance metrics (jaccard similarity, angular similarity, and max containment; not sure about average containment).
Hello Titus @ctb, I'm interested in this documentation! Especially in the difference between the containment metrics. Cheers!
per @ccbaumler https://github.com/sourmash-bio/sourmash/pull/2222#discussion_r949418693 we don't actually document max containment anywhere 😱
I can't think of a particularly good place to put it, either. We may need a new section; could be part of https://github.com/sourmash-bio/sourmash/pull/2184
and/or it may be time to add @bluegenes beautiful pictures into the sourmash documentation somewhere 🤔