Closed lomky closed 5 years ago
it's a bit of a tangled situation.
Issues to handle:
identifier
name
version
description
native_id
url
doi
release_dt
publication_year
description_attribution
These are the business of the dataset itself, GCIS has no stake in them.
type
data_qualifier
spatial_ref_sys
cite_metadata
scope
spatial_extent
temporal_extent
vertical_extent
processing_level
spatial_res
start_time
end_time
lat_min
lat_max
lon_min
lon_max
These fields have more to do with how a dataset was used than with the dataset itself or with the provenance metadata. Generally they should go on the Activity
in GCIS.
access_dt
scale
If they used a subset of a dataset, they may have:
spatial_extent
temporal_extent
vertical_extent
start_time
end_time
lat_min
lat_max
lon_min
lon_max
We either do not know what this field is for, or the fields are catch-alls that are not useful.
scale
attributes
variables
identifier
org_name
+ dataset_name
+ version
version
, .
become _
org_name
+ dataset_name
+ publication_year
name
version
description
native_id
url
doi
release_dt
publication_year
description_attribution
description
field. If same as URL, duplicate URL here. Made a couple of typo fixes, looks great to me.
Emergent GCIS dataset-related questions:
• In cases where a prospective dataset has many (more than 2) contributing organizations/dataset producers, no clear lead organization, and no citation documentation, how does the GCIS dataset name convention fair? For example, consider this dataset/data archive at https://gdo-dcp.ucllnl.org/downscaled_cmip_projections/dcpInterface.html#Links
• Besides being imported from “data.gov”, are there other reasons why we should/could consider using a dataset’s original identifier, especially when the pertinent dataset lacks a DOI?
• When should we use a parent organization name rather than a subsidiary name in our dataset name/identifier convention? For example, DOE as the prefix, rather than LLNL or EIA (the link in the first comment has LLNL as a data developer - the privacy and legal notice also links to LLNL's official website. Yet, the dataset website ends in ".org" and not ".gov").
The dataset identifier that the previous comments applied to seem fine. https://data-stage.globalchange.gov/dataset/ucllnl-downscaled-cmip3-cmip5-climate-hydrology-projections. No need for any revision.
I have reviewed recently created datasets, and the identifiers also seem fine. I will continue the review process indefinitely.
A ticket to discuss the conventions surrounding Dataset.
Current Dataset Conventions (Blank).
Dataset Fields:
Provenance Connections:
Relationships: