CIAT-DAPA / UsefulPlants-Indicator

Project Management of the Aich13 indicators
Mozilla Public License 2.0
4 stars 2 forks source link

Investigate ways to reduce dischord between taxonomies in different databases #144

Open ColinKhoury opened 6 years ago

ColinKhoury commented 6 years ago

For each database we normalized species names against the GBIF Taxonomic Backbone, using the GBIF Species Lookup Tool (GBIF 2018c) and the GBIF Species API v1 (GBIF 2018b). A 96.0% match was found between GBIF Taxonomic Backbone and WEP, and a 93.3% match was found between the Backbone and the GENESYS and the Global Crop Wild Relative Occurrence Database.

We should explore ways to fix taxonomy so that matches are as close to 100% as possible. TNRS?

danipilze commented 6 years ago

Sure, but please take in count we'll never reach the 100% since not everything in the taxonomy field can be verifiable as a valid taxonomy or even a word that looks like a taxonomy.

Here there are some highlight of the unmatched taxonomy:

image

image

Also, in the unmatched count I'm putting the matches that wasn't on species level (or lower), so the records document only to genus (or higher) level are counted there.