ImperialCollegeLondon / safedata_validator

Python tools to validate and publish datasets using the safedata metadata format.
https://safedata-validator.readthedocs.io/
MIT License
2 stars 4 forks source link

Dataset description does not handle GBIF and NCBI taxa correctly. #39

Closed davidorme closed 1 year ago

davidorme commented 1 year ago

Recent commits aligned the dictionary keys used in the GBIF and NCBI taxon indexes to make it easier to insert those values into the safedata_server taxa table for metadata searching. That broke taxa.ncbi_index_to_text and taxa.taxon_index_to_text and hence zenodo.dataset_description.

At the same time those changes make it easy to merge taxa.ncbi_index_to_text and taxa.taxon_index_to_text (which was really taxa.gbif_index_to_text) - there is a huge overlap in these functions and only the _taxon_format internal function differs.