genomehubs / goat-data

MIT License
2 stars 0 forks source link

Unable to find taxa #22

Closed ccaio closed 1 year ago

ccaio commented 1 year ago

GoaT is unable to pickup some of the DToL targets, despite having taxon IDs available. We need to understand why they are not in GoaT?

Examples of Taxon IDs (tardigrades from last week): 2883163, 2883156, 2883157, 2883158

More examples (Algae from this week): 2949304 2949301 2949299 Cymbomonas sp. PLY 262 Rhodomonas sp. PLY 904 Chlamydomonas sp. PLY 498

Most of them on ncbi, with a valid taxon ID: https://www.ncbi.nlm.nih.gov/taxonomy/?term=2949301

rjchallis commented 1 year ago

at least one of these taxids (2883163 - Echiniscus sp. ZA.555) is missing from the json file. An adjacent related taxid with a similar name (2883162 - Echiniscus sp. ZA.431) is present in the jsonl and does import. It would be useful for debugging to test if including 2883163 in the json makes it importable and if excluding 2883162 prevents it importing.

sujaikumar commented 1 year ago

Problems found, running test. Should be fixed by tomorrow's release :-). Will close issue after test succeeds

sujaikumar commented 1 year ago

Fixed now :-)