iobis / gbif-marine

2 stars 1 forks source link

potential duplicate datasets at GBIF #9

Open wardappeltans opened 8 years ago

wardappeltans commented 8 years ago

Can GBIF (@kcopas @dschigel) confirm that they can deal with potential duplicate datasets?

Note, when datasets from GBIF publishers are imported on OBIS node IPTs, they will not register those datasets again.

dschigel commented 8 years ago

If OBIS publishers are aware that some particular datasets are already published through GBIF from elsewhere, there is of course no need to try publish them again from OBIS nodes. If this, however, happens, deduplication at the dataset level works given that record and / or dataset identifiers were not changed by the publisher.

wardappeltans commented 8 years ago

Dear Dmitry, iOBIS only harvests from official OBIS nodes (tier-2 level), so we need all marine data to end up on an OBIS node IPT. However, we advice that those datasets that are already published through GBIF from elsewhere are not registered with GBIF again by the OBIS nodes. In some cases, however, a marine Publisher with GBIF may become a new OBIS Node (abiding to the terms, standards and best practices of OBIS), so that duplication of datasets is not needed and iOBIS can harvest them directly.