ncbi new_database failing unique constrain on `names`

fhcrc / taxtastic

Create and maintain phylogenetic "reference packages" of biological sequences.

GNU General Public License v3.0

21 stars 10 forks source link

ncbi new_database failing unique constrain on `names` #124

Closed dhoogest closed 5 years ago

dhoogest commented 5 years ago

Changes (ongoing) to NCBI taxonomy are causing the primary key relationship on names to be violated, resulting in a failure when executing taxit new_database. According to feedback from NCBI, this is the result of an ongoing 'upgrade' to the taxonomy system, and it does appear that records are being corrected incrementally (they also indicated that a "new version with more information may need to be adapted in the future")

Load of ncbi data from dump succeeds if the id primary column is restored, replacing the combined key of tax_id,tax_name, and name_class, however I'm not sure if there are downstream ramifications of this approach.