ImperialCollegeLondon / safedata_validator

Python tools to validate and publish datasets using the safedata metadata format.
https://safedata-validator.readthedocs.io/
MIT License
2 stars 4 forks source link

Handling of Incertae sedis output from genbank #172

Open jacobcook1995 opened 1 month ago

jacobcook1995 commented 1 month ago

Genbank style taxonomy output creates output for taxa of uncertain placement in the following format f_Sordariales_fam_Incertae_sedis, at the moment this fails because searching the NCBI database for e.g. Sordariales_fam_Incertae_sedis doesn't yield any sensible results.

In reality, ranks like this should be ignored, as there's no specific claim being made, but not if it's the lowest level rank populated