sourmash-bio / sourmash

Quickly search, compare, and analyze genomic and metagenomic data sets.
http://sourmash.readthedocs.io/en/latest/
Other
473 stars 80 forks source link

revisit `sourmash sig describe` CSV output #1854

Open ctb opened 2 years ago

ctb commented 2 years ago

compare/contrast with manifests and esp output formats from sourmash sig manifest - https://github.com/sourmash-bio/sourmash/pull/1837

ctb commented 2 years ago

sig describe contains all of the manifest fields but two - it doesn't contain internal_location or md5short.

it also contains the extra fields signature_file, seed, and license - we want to add the latter two in #1849 anyway :).

I think there are some interesting potential crossovers with manifests of manifests https://github.com/sourmash-bio/sourmash/issues/1685, where sig describe is producing a format that could be used there.

ctb commented 2 years ago

maybe a goal of this and sig manifest could be to produce / update manifests of manifests? And we could support multiple formats, e.g. both CSV and sqlite.

ctb commented 2 years ago

https://github.com/sourmash-bio/sourmash/pull/1882 adds sum_hashes