HumanCellAtlas / dcp-community

HCA Data Coordination Platform community content
5 stars 18 forks source link

Data Citation Plan #103

Closed theathorn closed 5 years ago

theathorn commented 5 years ago

This RFC outlines a 3-phase plan for providing Data Citation support for the HCA DCP experimental data and associated metadata.

Status: Oversight Review Last call for oversight review: 4 Oct 2019

Summary of Review Discussion for Approvers There has been general acceptance of the 3 proposed implementation phases with Phase 1 consisting of a stable project URL. Objections to using an external DOI registration agency (such as Zenodo) have been raised. This matter has been referred to UX for further research, following which a recommendation to the Oversight Committee will be made. There was a lengthy discussion on the possible use of Compact Identifiers within the HCA metadata but there is no decision to adopt these at this time.

mckinsel commented 5 years ago

Should we do anything for users who don't get their data via the Data Browser?

mweiden commented 5 years ago

@mckinsel @theathorn During Phase 1, I guess the assumption is that users will understand how to link the project uuid to the project uuid in the Matrix Service?

theathorn commented 5 years ago

I guess the assumption is that users will understand how to link the project uuid to the project uuid in the Matrix Service?

The citation link (which the Data Browser would supply to the user and may well encode the project_uuid) would get the user to Data Browser project detail page where they could click the "mtx" icon to download the project's matrix.

theathorn commented 5 years ago

Should we do anything for users who don't get their data via the Data Browser?

I think you mean programmatic access via the Matrix Service and/or Query Service? That's currently out-of-scope for this RFC but I'm open to suggestions from Tech Arch as to how this might be achieved in the future.

diekhans commented 5 years ago

I have absolutely no problem with phase 1 as long as we are making it clear we are not citing specific sets of data.

lauraclarke commented 5 years ago

Should we do anything for users who don't get their data via the Data Browser?

I think you mean programmatic access via the Matrix Service and/or Query Service? That's currently out-of-scope for this RFC but I'm open to suggestions from Tech Arch as to how this might be achieved in the future.

@theathorn Maybe a feature request for the HCA cli would be a get citation which would return the same text as the data portal widget does, at least make it easy for programmatic users to get the info without having to browse the website and find what they are looking for.