Reference database augmentation
Easy to access and well curated reference databases empower open-source research and help scale the
impact of microbiome research. NMDC will partner with the SeqCode Initiative (see Letter of Support) to
expand the current list of bacterial and archaeal taxa by incorporating data inferred by genome sequences
and providing new taxonomic names to microbes that have yet to be cultured but whose genomes have
been interrogated. To provide more taxonomic context for microbiomes under study, and to support
broad community adoption of the SeqCode taxonomy and nomenclature system, the NMDC will work
directly with the SeqCode development team to outline a plan to update our sequence-based reference
genome and taxonomy databases (Milestone 2.21). Together with the proposed new viral workflow
above, this effort will enable NMDC metagenomes and metatranscriptomes to be annotated with the
current SeqCode and ICTV taxonomies, and aligns with development activities to update the Pilot
metagenome workflow to the current JGI-production version (see An updated metagenome annotation
workflow). Our development teams will explore the feasibility of sharing metagenome bin data which
passes SeqCode criteria for taxonomic registration76, to directly support and expand the SeqCode
taxonomy with new data that will be processed and hosted in the NMDC Data Portal (Milestone 2.22).
Reference database augmentation Easy to access and well curated reference databases empower open-source research and help scale the impact of microbiome research. NMDC will partner with the SeqCode Initiative (see Letter of Support) to expand the current list of bacterial and archaeal taxa by incorporating data inferred by genome sequences and providing new taxonomic names to microbes that have yet to be cultured but whose genomes have been interrogated. To provide more taxonomic context for microbiomes under study, and to support broad community adoption of the SeqCode taxonomy and nomenclature system, the NMDC will work directly with the SeqCode development team to outline a plan to update our sequence-based reference genome and taxonomy databases (Milestone 2.21). Together with the proposed new viral workflow above, this effort will enable NMDC metagenomes and metatranscriptomes to be annotated with the current SeqCode and ICTV taxonomies, and aligns with development activities to update the Pilot metagenome workflow to the current JGI-production version (see An updated metagenome annotation workflow). Our development teams will explore the feasibility of sharing metagenome bin data which passes SeqCode criteria for taxonomic registration76, to directly support and expand the SeqCode taxonomy with new data that will be processed and hosted in the NMDC Data Portal (Milestone 2.22).
Page 34
See #460 #461 #462