Open ctb opened 2 years ago
What I wrote over in the genome-grist docs:
This file contains at least 8 columns, with the headers ident
and superkingdom
, phylum
,class
,order
,family
,genus
,species
.
see also taxonkit info https://github.com/sourmash-bio/sourmash/issues/1851
this topic, plus discussions about NCBI, GTDB, LINS, and ICTV taxonomies could usefully go in either https://sourmash.readthedocs.io/en/latest/databases-advanced.html or https://sourmash.readthedocs.io/en/latest/sourmash-internals.html#taxonomy-and-assigning-lineages. There's already some stuff in the latter location!
right now it's not really specified anywhere 😆