gbif-norway / helpdesk

Please submit your helpdesk request here (or send an email to helpdesk@gbif.no). We will also use this repo for documentation of node helpdesk cases.
GNU General Public License v3.0
3 stars 0 forks source link

"duplicate" records published to GBIF - query from Leif Aarvik #61

Open rukayaj opened 2 years ago

rukayaj commented 2 years ago

Leif's original query was that some species records are appearing on Artskart which should not be appearing as they are not 'wild' occurrences. I have sent Artsdatabanken an email about this asking what their policy is with publishing such records.

An example of a record published from Corema: https://www.gbif.org/occurrence/2571193179. Corema export contains establishmentMeans = introduced, degreeOfEstablishment = casual and pathway = transportContaminant. Note that currently we are not actually publishing degreeOfEstablishment or pathway yet, as GBIF still need to add these terms to the IPT mapping (https://github.com/gbif/ipt/issues/1532).

Leif was also concerned that Artskart seem to be publishing another (i.e. "duplicate") record you can see this here. This second record is this one https://www.gbif.org/occurrence/3349442605 in GBIF, from ENA's INSDC Sequences dataset. I think it has probably come from the BOLD record that has been published. This is all fine, but I see that the BOLD record and therefore the INSDC Sequences record does not have the establishmentMeans field, which it ideally should.

So, we need to verify that it is the BOLD record is somehow feeding through to this INSDC Sequences dataset, and work out how to make sure the establishmentMeans field (and degreeOfEstablishment and pathway) get fed through as well.