brshipley / megaSDM

Other
22 stars 3 forks source link

Citation of GBIF occurrences #1

Closed dnoesgaard closed 2 years ago

dnoesgaard commented 2 years ago

Kudos on the R package and ecography paper!

I wanted to reach out to raise an issue around the potential use of GBIF occurrences via rgbif::occ_search(). As you may know, GBIF-mediated data is open and free for all to use, but comes with the requirement that users publicly acknowledge the data publishers whose data is used. For third-party applications, the best approach to enable users to live up to this requirement is simply to ensure that the datasetKey column is returned and retained in all queries and applications. With this data, users can create a derived dataset record in GBIF to have a unique identifier representing the specific data they obtained and used. More here: https://www.gbif.org/citation-guidelines#thirdParty

I hope you would agree that crediting data publishers is important in this and I'd love to hear your thoughts on this.

brshipley commented 2 years ago

Thank you!

I completely agree that crediting the publishers of the data is vital, and I've added "datasetKey" as a column in our OccurrenceCollection function. This should allow for proper citation of the data, in addition to generation of derived datasets. I believe we originally return that column in early versions of the code, but somehow it was edited out. Thanks for bringing this to my attention!

dnoesgaard commented 2 years ago

Thanks the prompt response and action! Much appreciated!