sourmash-bio / sourmash

Quickly search, compare, and analyze genomic and metagenomic data sets.
http://sourmash.readthedocs.io/en/latest/
Other
458 stars 78 forks source link

provide databases with standard metadata fields? #1847

Open ctb opened 2 years ago

ctb commented 2 years ago

like authors, citation DOI, original download URL, version, and description

bluegenes commented 2 years ago

+1000!

Will really help with db tracking, versioning, etc instead of having to rely on names.

ctb commented 2 years ago

idle thought: I wonder if we could/should use the hackmd approach of having standard YAML headers? e.g. a README.md file in a zipfile like this,

---
author:
version: 
DOI:
---

# general markdown information

this database was lovingly handcrafted by Tessa Pierce to meet the needs of modern researchers
ctb commented 2 years ago

or we could continue to avoid YAML and go with TOML or something 🤔

ctb commented 2 years ago

creation date!

ctb commented 2 years ago

related: https://github.com/sourmash-bio/sourmash/issues/1005