plazi / community

This repo is intended to serve as a help desk for TreatmentBank-users.
6 stars 1 forks source link

explosion of recordedBy from treatment published April 2021 #70

Open dshorthouse opened 3 years ago

dshorthouse commented 3 years ago

Parsing of content for eventDate and recordedBy in this very recent treatment made an explosion.

Context GBIF occurrence Plazi reference

mguidoti commented 3 years ago

Thanks for reporting this!

We started to process material citations from Pensoft journals, and, we might get some issues until we are fully adapted. But we're already looking into it, and will fix it in a timely manner.

dshorthouse commented 3 years ago

Sorry, was inadvertently clicking buttons.

mguidoti commented 3 years ago

No worries.

So, we have several different rules blocking the data transit to GBIF, regarding material citations alone as you can see in the print below:

image

This is from the QC dialog in GGI and these rules were added by Oct-Nov, last year. The novelty here is the fact that this is a Pensoft import, meaning, it's coming from a XML and not a processed PDF. As said, we are just starting with the processing of matCits from these Pensoft imports and I've a feeling that it haven't passed through the "gatekeeper" like it's suppose to. If so, the transit would have been blocked and we would have to manually curate the paper before letting it go to GBIF - which is our normal workflow now.

So we are checking it and not only fixing it.

As soon as possible we'll get back to you here.