CatalogueOfLife / data

Repository for COL content
7 stars 2 forks source link

>24.500 names in IUCN redlist not covered by COL #529

Open mdoering opened 1 year ago

mdoering commented 1 year ago

The IUCN Redlist contains ~24.500 names not covered by the COL Checklist, maybe more. The GBIF Backbone adds these names to the COL Checklist and offers a search and facet counts for these to get an overview:

image

There are even 3400 accepted species missing:

image

As the redlist is an important list of species it would be good to investigate what the cause of this is in COL. I would not have expected that many missing names. Given that the redlist contains 250.000 names this is roughly 10% missing in COL.

mdoering commented 1 year ago

Looking at a random accepted chordate species with some GBIF occurrences, these are mostly reptiles, but also fish and even birds:

https://www.gbif.org/species/10698304 https://www.gbif.org/species/2444434 https://www.gbif.org/species/2451139 https://www.gbif.org/species/2419793 https://www.gbif.org/species/4848207

Looking closely most of these do actually come from COL. It seems rather to be a problem with the species search in GBIF!

mdoering commented 1 year ago

For example the species Uma rufopunctata is listed as coming from IUCN: https://www.gbif.org/species/search?q=Uma&rank=SPECIES&dataset_key=d7dddbf4-2cf0-4f39-9b2a-bb099caae36c&constituent_key=19491596-35ae-4a91-9a98-85cf505f1bd3&highertaxon_key=44&origin=SOURCE&status=ACCEPTED&advanced=1

But on the species page it says COL ReptileDB: https://www.gbif.org/species/2451139