Open mdoering opened 6 months ago
@DianRHR @camiplata maybe you can go through some of these names and place them into the correct parser file?
We should consider to also process all verbatim gbif classifications and see which ones create no group at all
An important part of the name usage matching, apart from plain name matching, is to compare the classification of matched candidates to disambiguate homonyms. As classifications can be very different in some parts or exist only patchy the algorithm rather tries to match each higher taxon to a limited, hand selected set of hierarchical taxonomic groups to keep the major groups apart, e.g plants to animals. For each of the groups we maintain a text file listing higher names down to families that unambiguously indicate such a group. For example
Asteraceae
clearly point to Angiosperms.ChecklistBank has a tool to analyse all higher names and report those that currently are not listed in any of the files. Go through at least the names down to class, better order, and add them to the respective parser files.
File with higher CLB names not_mapped.tsv.gz to any tax group