cf-convention / cf-conventions

AsciiDoc Source
http://cfconventions.org/cf-conventions/cf-conventions
Creative Commons Zero v1.0 Universal
87 stars 45 forks source link

Add attribute citation_id #160

Closed castelao closed 1 year ago

castelao commented 5 years ago

THIS IS OUTDATED. I'm editing this proposal to reflect the discussions so far, but I'll save a copy of this original proposal.

Title: DOI attribute

Moderator: to be defined

Requirement Summary: Optional DOI attribute in section Description of file contents (2.6.2).

Technical Proposal Summary: Add a new optional attribute to designate the Digital Object Identifier (DOI) of the data contained in the CF data object.

Benefits: DOIs allow easy automation for tracking the scientific impact of the data on the exact same fashion that scientific publications are tracked with DOIs. Anyone involved in the resulted data can be recognized, including funding agencies.

Status Quo: An increasing number of scientific journals start to require a DOI for the dataset used in the publication. Many groups already include DOI as an attribute in its NetCDF-CF datasets but without a standard, thus hard to automate.

Detailed Proposal: The only modification required would be in section 2.6.2: Description of file contents. In the bottom, after item comment, it would be added:

doi: Digital object identifier (DOI) of the dataset. For simplicity, the proxy part
       of the DOI is dropped, so it is composed by the suffix plus the prefix only,
       e.g. “10.21238/S8SPRAY1618”.

As mentioned in the 2.6.2 section, all attributes are optional, and the doi would follow the same rule.

This propose was developed with the help of @kenkehoe

Reasoning:

DOI is a de facto standard to track academic publications, thus providing the foundation for some measurement of scientific impact. There is a clear intention by the scientific community to also track the scientific impact of data and software, thus giving proper credit for who makes those available. The strategy adopted by AMS journals, and more recently by AGU, was to require citation of the dataset DOI used in any publication in the references list (https://www.ametsoc.org/ams/index.cfm/publications/authors/journal-and-bams-authors/formatting-and-manuscript-components/references/dataset-references/). The use of DOI for datasets will increase. A few groups already include the dataset DOI in its NetCDF-CF data files, but without a standard, it is hard for a machine to keep track of that.

Justification:

Tiny background on DOI:

Details

Example

// global attributes:
    :Conventions = "CF-1.7, ACDD-1.3" ;
    :title = "California Underwater Glider Network" ;
    :featureType = "trajectoryProfile" ;
    :id = "CUGN_90" ;
    :standard_name_vocabulary = "CF Standard Name Table v62" ;
    :doi = "10.21238/S8SPRAY1618" ;
castelao commented 1 year ago

@JonathanGregory , thanks for checking. I still think it is an important issue but indeed it didn't come to a consensus. It is probably better to close this issue. I suggest just holding it for a few days. Maybe somebody else might be interested in moving/saving these discussions to another convention or standard.

Thanks!

castelao commented 1 year ago

It's time to close this one. Thank you all for your contributions.

@justinbuck, @vturpin, @emmerbodc, @jenseva, some ideas and discussions here might be of your interest.