Closed lomky closed 6 years ago
identifier - fine as UUID
url - the full url, not including any parameters. Parameters should be collected on either the activity or references as appropriate.
title - whatever the Reference names the URL, otherwise as stated on the webpage itself, correct as of the creation of the object.
access_date - the access date for this webpage. Should not be used, as we have unique URL requirement. A access date should go on either the Reference object (e.g. if this is a referenced publication) or an Activity object (e.g. if this cited on a Figure)
Webpages are valid to be citedBy
. They do not use cites
, as they are not USGCRP products.
Contributors: webpages often have a host
, and may have authors
, as appropriate.
Files - not used on Webpages.
gcmd_keywords & regions not yet implemented.
Nice improvements:
It would be nice to be able to mark a website as known to be dead or defunct.
Edge Case Conventions:
This all looks good to me. Can be accepted as is, though it would be good add something from @amruelama about making sure we don't cite a dataset landing page as a webpage and how to make that distinction.
Here are few examples of webpages that look like datasets (data in this case):
Will list out the distinctions based on different examples soon.
I put the content into the document. Leaving this open until we have the language on webpage vs dataset.
@amruelama any progress on webpage
vs dataset
distinctions?
This will require a manual QA to determine if a dataset is categorized as a webpage in GCIS. We have done this process previously in issue #329. Basically, we need to use its reference as a source and determine if they have mentioned the use of the 'dataset' to derive a table of a figure. The easiest way to filter this is searching the keyword 'data' or 'dataset' in its URL. Also, a thorough QA is needed in order to complete this process. After the QA process, we should convert the webpage to dataset (#506) To avoid this misinterpretation in the future, this QA could be done before the release when syncing the reference as type 'web page' and while adding new child publications as type 'webpage' through script.
A ticket to discuss the conventions surrounding Webpage.
Current Webpage Conventions.
Webpage Fields:
Provenance Connections:
Relationships: