Ecogenomics / GTDBNCBI

The GTDB provides the software infrastructure for working with a large collection of genomic resources. The major goal of this initiative is to provide a phylogenetically consistent taxonomy for archaea and bacteria.
https://gtdb.ecogenomic.org/
GNU General Public License v3.0
9 stars 2 forks source link

Propagate taxonomic annotations to revised NCBI genomes #31

Closed donovan-h-parks closed 8 years ago

donovan-h-parks commented 8 years ago

NCBI occasionally revises genomes. This results in an incremental update to their assembly accession number (e.g., GCA_1234567.1 to GCA_1234567.2). In such cases, it would be very useful if taxonomic annotations were automatically assigned to the revised genome (i.e., from GCA_1234567.1 to GCA_1234567.2). This should reduce the amount of work required to update the taxonomy with each new pull from NCBI.

donovan-h-parks commented 8 years ago

Pierre, can you confirm this is now in place?

pchaumeil commented 8 years ago

The versioning update is in the update_database script.

However the script needs to be modified to delete all aligned markers from the previous version.

pchaumeil commented 8 years ago

Fixed in : a7f9127b34385c1428c9307a8b68f7ec16e52dfe