sourmash-bio / sourmash

Quickly search, compare, and analyze genomic and metagenomic data sets.
http://sourmash.readthedocs.io/en/latest/
Other
473 stars 80 forks source link

document max containment somewhere. #2224

Open ctb opened 2 years ago

ctb commented 2 years ago

per @ccbaumler https://github.com/sourmash-bio/sourmash/pull/2222#discussion_r949418693 we don't actually document max containment anywhere 😱

I can't think of a particularly good place to put it, either. We may need a new section; could be part of https://github.com/sourmash-bio/sourmash/pull/2184

and/or it may be time to add @bluegenes beautiful pictures into the sourmash documentation somewhere 🤔

ctb commented 2 years ago

thinking -

ctb commented 2 years ago

more generally, we should describe the math for all of the various similarity calculations used:

as well as point out which ones are distance metrics (jaccard similarity, angular similarity, and max containment; not sure about average containment).

jorondo1 commented 1 year ago

Hello Titus @ctb, I'm interested in this documentation! Especially in the difference between the containment metrics. Cheers!