CatalogueOfLife / testing

Editorial tests and discussion to prepare for COL releases
2 stars 0 forks source link

ReptileDB (id 1008): test report #129

Open yroskov opened 3 years ago

yroskov commented 3 years ago

@gdower used database dump 2020_12_v2 + classification from http://www.reptile-database.org/db-info/taxa.html

First version on DEV: https://data.dev.catalogueoflife.org/dataset/2144/classification

yroskov commented 3 years ago

CLASSIFICATION

image

Children taxa from the website:

Acontinae: children Acontias, Typhlosaurus https://reptile-database.reptarium.cz/advanced_search?taxon=Acontinae&submit=Search - these 2 genera are in subfamily NotAssigned in DEV

Ateuchosauridae: 1 child Ateuchosaurus (2 spp) https://reptile-database.reptarium.cz/advanced_search?taxon=Ateuchosauridae&submit=Search - the genus & 2 spp are in family Scincinae in DEV

Ristellidae: children Lankascincus, Ristella https://reptile-database.reptarium.cz/advanced_search?taxon=Ristellidae&submit=Search - these 2 genera are in family Sphenomorphinae in DEV

yroskov commented 3 years ago
gdower commented 3 years ago

Fixed those issues--it was caused by atypical formatting in the classification field.

yroskov commented 3 years ago

Website: image

yroskov commented 3 years ago
yroskov commented 3 years ago

image

Website says, 1 sp Calabaria reinhardtii. https://reptile-database.reptarium.cz/species?genus=Calabaria&species=reinhardtii&search_param=%28%28taxon%3D%27Calabariinae%27%29%29 The species is in subfamily NotAssigned in DEV: https://data.dev.catalogueoflife.org/dataset/2144/classification?taxonKey=Squamata-Booidea-Boidae-Calabaria%20reinhardtii%20%28Schlegel%2C%201851%29

yroskov commented 3 years ago
yroskov commented 3 years ago
yroskov commented 3 years ago

DEV: image

Website: order Crocodylia - suborder Eusuchia Family Alligatoridae Subfamily Alligatorinae Subfamily Caimaninae

image

CoL solution: ignore subfamilies in family Alligatoridae via clearinghouse

yroskov commented 3 years ago

TESTS OF FINAL VERSION (2020-12) on prod: https://data.catalogueoflife.org/dataset/1008/classification

yroskov commented 3 years ago

ISSUES

yroskov commented 3 years ago

TASKS

image

Resolved 2021-05-24

image

yroskov commented 3 years ago

ReptileDB ver 2020-12; Received by COL: 2021-05-03 - synced 2021-05-24 (with subsps as synonyms). Published in CoL 2021-06-10.

yroskov commented 3 years ago

ReptiveDB ver 2021-05; received by CoL 2021-05-23. Imported on prod 2021-06-23 (with accepted subsps).

ISSUES (assessed)

yroskov commented 3 years ago

TASKS 2021-06-23

image

Resolved 2021-06-24

image

Synced 2021-06-24

yroskov commented 3 years ago

ReptileDB in ac21: 6,681/0 spp (vs 11,440/0 in June 18). Loss of 4,759 spp in CoL [during sync?].

@gdower hypothesis: loss caused by the decisions "block species".

I do not see a reason for blocking those accepted species. There is no tool to delete a block of filtered decisions at https://data.catalogueoflife.org/catalogue/3/decision?limit=100&offset=0&subjectDatasetKey=1008.

@gdower has deleted decisions programmatically 2021-08-20.

yroskov commented 3 years ago

TASKS after deletion of decisions reptileDB + blocked + species: image

Resolved: image

Synced 2021-08-20.

yroskov commented 2 years ago

New version 2021-11. Dump received 2021-11-15.

The importer failed for reptiledb, getting stuck on something with distribution gazetteers. https://github.com/CatalogueOfLife/backend/issues/1064 = FIXED Re-imported on DEV with the distribution data removed: https://data.dev.catalogueoflife.org/dataset/2144/classification

Imported to prod 2021-11-16

yroskov commented 2 years ago

can we do something about this case? (So far, I left names without decision)

Another example: https://data.catalogueoflife.org/catalogue/3/dataset/1008/workbench?facet=rank&facet=issue&facet=status&facet=nomStatus&facet=nameType&facet=field&facet=authorship&facet=authorshipYear&facet=extinct&facet=environment&facet=origin&limit=50&offset=0&q=Liolaemus%20bellii image

@gdower, I guess crawling/parsing need our further attention.

yroskov commented 2 years ago

TASKS 2021-11-16 image

Resolved: image

Synced 2021-11-16 (with remaining issues)

yroskov commented 2 years ago

Markus: some vernacular names have the language given at the front of the actual name e.g. (E) Freckled Monitor or (G) Trauerwaran (https://github.com/CatalogueOfLife/data/issues/352#event-5625694715)

https://data.catalogueoflife.org/dataset/1008/taxon/Reptilia-Squamata-Platynota-Varanidae-Varanus%20tristis-41ecd4c74f9a68152e708c6f4640ae6f

@gdower, can your convertor script resolve this?

yroskov commented 2 years ago

Spotted problems (preview 2022-10-12):

subfamily: Elapinae image

subfamily: Leiolepidinae: image

yroskov commented 2 years ago

Attention: two separate genera Macrelaps and Micrelaps in the family Atractaspididae.

Peter, 2022-01-14: Yes, that’s correct (and potentially confusing): two genera (within the same family) that are not even closely related.

yroskov commented 2 years ago

Revisiting "decisions" after fixes https://github.com/CatalogueOfLife/testing/issues/184#issuecomment-1055567444

TASKS 2022-03-04 image

Resolved 2022-03-04: image

Synced 2022-03-04

yroskov commented 7 months ago

Due to changes in Vertebrata classification (https://github.com/CatalogueOfLife/testing/issues/186#issuecomment-1828674947): Synonym "Reptilia" should be added with 4 orders Rhynchocephalia, Squamata, Crocodylia & Testudines. Implemented 2024-02-05: these four orders established as new sectors from the resource id 279229, synced; ReptileDB children taxa re-established as sectors under orders of resource id 279229.

ReptileDB re-synced 2024-02-05

AS A RESULT of implementing the classification from the resource id 279229 "COL Checklist Chordata Higher Classification, ver 1.0 / 2023-11-27", rank Class is missing now for all reptiles.

yroskov commented 7 months ago

Doing cleaning of outdated decisions... 2024-02-07: 1,351 outdated decisions left in CoL, mainly in ReptileDB. The same subspecies is accepted name and synonym to its parent species.

Need: (1) delete all outdated decisions [x] & (2) re-do all tasks

image

Resolved 2024-02-07:

image

Re-synced 2024-02-07

yroskov commented 5 months ago

ReptileDB ver 2023-09 / 2023-09-30; dump received from Peter 2024-03-21, plus classification harvested from http://www.reptile-database.org/db-info/taxa.html; imported 2024-03-21

Tested on DEV https://github.com/CatalogueOfLife/testing/issues/134#issuecomment-2013245082

Metrics

image

ISSUES assessed 2024-03-22

image

TASKS

image

yroskov commented 5 months ago

Have look on Ablepharus (Morethia) anomalus (Adelaidensis) Peters, 1874 Ablepharus seydeli DE Witte, 1933 Agama (Eremioplanis) lessonae DE Filippi, 1865 Amphiesma andreae Ziegler & LE Khac Quyet, 2006 Amphiesma metusia Inger, Zhao, Shaffer & WU, 1990 Anolis boulengeri O'SHAUGHNESSY, 1881 Acontias (Evesia) smithi Deraniyagala, 1934

yroskov commented 5 months ago

There are species names with very long authorstrings, for example: Bothriechis khwargi Arteaga, Pyron, Batista, Vieira, Pelayo, Smith, Barrio-Amorós, Koch, Agne, Valencia, Bustamante, Harris & Guayasamin, 2024 (https://zoobank.org/NomenclaturalActs/8BFF5643-306F-435A-B074-668B41C43291) Bothriechis rahimi Arteaga, Pyron, Batista, Vieira, Pelayo, Smith, Barrio-Amorós, Koch, Agne, Valencia, Bustamante, Harris & Guayasamin, 2024 etc.

yroskov commented 5 months ago

ReptileDB ver 2024-03 / 2024-03-28; dump received from Peter 2024-03-30, plus classification harvested from http://www.reptile-database.org/db-info/taxa.html; imported 2024-04-01

Metrics

image

ISSUES assessed 2024-04-01

image

TASKS

image

Redundant subspecies synonymy with parent species (unresolved): ACC-SYN infraspecies (different accepted, different authors) 2 of 218 ACC-SYN infraspecies (different accepted, same authors) 1 of 1860

Resolved 2024-04-01:

image

Synced 2024-04-01

yroskov commented 5 months ago

ReptileDB ver 2024-03 / 2024-03-28; new iteration imported 2024-04-02

Classification:

TASKS resolved 2024-04-03:

image

Synced 2024-04-03

yroskov commented 3 months ago
yroskov commented 3 months ago

ReptileDB ver 2024-03 / 2024-03-28; new iteration imported 2024-06-07

TASKS

image

Resolved 2024-06-07:

image

Synced 2024-06-07