tdwg / dwc-qa

Public question and answer site for discussions about Darwin Core
Apache License 2.0
49 stars 8 forks source link

serving datasetID for some, not all records #100

Open eclites opened 7 years ago

eclites commented 7 years ago

From https://github.com/tdwg/dwc-qa/wiki/Institutions-and-Collections, it seems that datasetID would be an appropriate way for a TCN to credit relevant data. Will iDigBio/GBIF serve datasetID if it is provided? Will it be a problem if datasetID is blank for some records?

dagendresen commented 7 years ago

The term "datasetID" in the occurrence-core is not mandatory in GBIF (and I believe also not indexed by GBIF). http://rs.gbif.org/core/dwc_occurrence_2015-07-02.xml#datasetID The concept "datasetKey" in GBIF identifies a resource such as an entire Darwin Core Archive. And cannot be declared per record.

eclites commented 7 years ago

Okay, thanks!

ekrimmel commented 6 years ago

This issue more broadly relates to the question of how best to track and aggregate occurrence records related to a funded project, e.g. a TCN, which is something that was discussed by some of the paleo community (@eclites @hollyel @tkarim) at the Digital Data meeting in Berkeley last month.

Opinion from @kevinlove was that dwc:datasetName (or dwc:datasetID) is not the right place. He suggested possibly a new field, which is what oVert is doing. Another option is to use dynamic properties.

dwc:datasetName is indexed by iDigBio, e.g. http://search.idigbio.org/v2/summary/top/records?top_fields=[%22indexData.dwc:datasetName%22]&count=100

This may not be a DwC Hour topic, but is probably something useful to document here if/when we reach any consensus.

hollyel commented 6 years ago

related to @ekrimmel comment see #37

debpaul commented 6 years ago

To @eclites the issue is really that while a TCN exists, data (so far) from each TCN data partner now comes to us separately. We need a way to credit and refer to both the TCN as an entity, but also the institution providing some data for that TCN as a TCN member partner. Another field (that needs adding?) would be one that allows one-to-many for referencing grant numbers relating to a given specimen (and/or dataset). NSF would be very happy to see such a thing so that metrics could be done to show what's been amassed with NSF funding.

tucotuco commented 6 years ago

Would NSF be willing to pay for the effort to make that happen?

debpaul commented 6 years ago

that is a very interesting question that we should ask!

On 2018-07-02 4:28 PM, John Wieczorek wrote:

Would NSF be willing to pay for the effort to make that happen?

— You are receiving this because you commented. Reply to this email directly, view it on GitHub https://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_tdwg_dwc-2Dqa_issues_100-23issuecomment-2D401924257&d=DwMCaQ&c=HPMtquzZjKY31rtkyGRFnQ&r=ODXYRdWm1Oqf5-w5G2NjQw&m=B-Q6Xas6pVTq6VwnzWEM7txrZ2YslKrS7CjRdglHIl0&s=8byGbTkQNnvWuUDZ0pT2cbpCh3h9Nk3EdRfkVUdQpzs&e=, or mute the thread https://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_notifications_unsubscribe-2Dauth_AC2gS57g-2DzHEaIwMv3O8-5FmqnuiaOi-5FPUks5uCoJUgaJpZM4PbT1D&d=DwMCaQ&c=HPMtquzZjKY31rtkyGRFnQ&r=ODXYRdWm1Oqf5-w5G2NjQw&m=B-Q6Xas6pVTq6VwnzWEM7txrZ2YslKrS7CjRdglHIl0&s=wyN8KvRtjA8CbfgewBWFG-8J_nCJn4RlS37WXkLUdlQ&e=.

-- -- Upcoming iDigBio Events https://www.idigbio.org/calendar -- Deborah Paul, iDigBio Digitization and Workforce Training Specialist iDigBio -- Steering Committee Member SPNHC Liaison, Member-At-Large and Member International Relations Committee SYNTHESYS3 Representative Institute for Digital Information, 234 LSB Florida State University Tallahassee, Florida 32306 850-644-6366