Open jeanetteclark opened 5 years ago
What kind of publisher metadata can we automatically insert? Doesn't that need to be provided by the user? Or should we automatically set it to the Member Node name?
No, not provided by the user because the publisher is whatever member node they are publishing to.
It goes in eml/dataset/publisher
as a responsibleParty
, I imagine just with Arctic Data Center as the organization, and one of the above identifiers (although I cannot find what our organization identifier is in any of those systems for the life of me). @mbjones may be able to advise
@laurenwalker ADC and other repositories may not be registered in those systems, but we need to get them there. I asked @gothub a few months ago to look into getting that rolling, and we ran into some timing issues with ROR. But GRID and Wikidata should be possible. Once they are in GRID, they should automatically end up in ROR.
our wikidata identifier is: Q77285095
I noted today that users can enter their own publisher information. It seems to me that they shouldn't be able to do this - the field is usually misinterpreted anyway. So as part of this issue I think we should consider removing that ability from the UI
Agreed, or at least make it a config option as to whether it shows up.
Here is a summary of the tasks required for this issue, as I understand it:
option 1: we add a new config option that is specifically used to provide the Publisher information, e.g.
/**
* Information about the repository that will be automatically inserted into
* new EML metadata documents as the Publisher. This object can set any of the
* fields that are available in the Responsible Party EML type, see
* {@link https://github.com/NCEAS/eml/blob/main/img/eml-party.png},
* @type {object}
*/
publisher: {
organizationName: 'Arctic Data Center',
userId: 'Q77285095',
onlineUrl: 'https://arcticdata.io'
}
option 2: we could pull this information from other configuration options. organizationName
= repositoryName
, onlineUrl
= baseUrl
. We would just need to add a repositoryId
for the userId
, and a automaticallyFillPublisher
(or similar) boolean option.
showPublisherInEditor
config option. When the publisher
option is empty (option 1) or automaticallyFillPublisher
is false (option 2), then check this showPublisherInEditor
option to decide whether or not to display the "Publisher" role in the People section of the EML editor:
@mbjones and @jeanetteclark, what do you think of this plan and do you have any feedback on these two questions? Thanks!
To answer your questions @robyngit:
1. Do we add this information:
a. to new EML documents only?
b. also to existing EML documents that have no publisher when they are edited?
2. When the Publisher information is pre-filled, should we display this in the editor but make the fields un-editable? Or just keep it hidden behind the scenes?
Answers: 1a & 1b. We add publisher information to all new EML documents, as well as to existing EML documents that did not have that info listed when they are edited. So if someone were to edit a published dataset, we're still checking that the publisher information is there when we curate the new changes. If it's not, we'll add it.
Here's our current metadata entry for the ADC, listing our various identifiers and other info from our schema.org entry on our home page:
I'm guessing the ROR is the best identifier to use these days.
When a dataset is submitted to member node, the publisher information should automatically be inserted into the EML document on submission. This will make our metadata more Accessible according to the FAIR metadata quality suite.
We should include an organization identifier in the
userId
field. "ROR or GRID or WIKIDATA would be good" - @mbjones