ESIPFed / science-on-schema.org

science-on-schema.org - providing guidance for publishing schema.org as JSON-LD for the sciences
Apache License 2.0
113 stars 33 forks source link

Repository metadata consistency with RDA recommendations #247

Open mbjones opened 11 months ago

mbjones commented 11 months ago

The RDA Data Repository Attributes working group is proposing repository metadata attributes to be used in describing repositories (see their Draft Version 5), and these overlap with the ESIP science-on-schema.org guidelines for repository metadata. I did a little crosswalk to see how consistent they are, and I worked up a document for the Arctic Data Center that illustrates usage.

Here's a comparison table of how the ESIP and RDA field lists correspond.

number label schema_mapping esip_recommended rda_recommended
1 Repository Name schema:name TRUE TRUE
2 URL schema:url TRUE TRUE
3 Country schema:addressCountry TRUE
4 Language schema:inLanguage TRUE
5 Institution schema:parentOrganization TRUE TRUE
6 Contact schema:contactPoint TRUE TRUE
7 Description schema:description TRUE TRUE
8 Research Area schema:category TRUE TRUE
9 Persistent Identifiers ex:persistentIdentifiers TRUE
10 Machine Interoperability ex:machineInteroperability TRUE
11 Metadata ex:metadata TRUE
12 Curation ex:curation TRUE
13 Terms of Deposit schema:termsOfService TRUE
14 Terms of Access ex:termsOfAccess TRUE
15 Dataset Use License schema:license TRUE
16 Certification schema:credentialCategory TRUE
17 Preservation Policy ex:preservationPolicy TRUE
18 Repository Legal Name schema:legalName TRUE
19 Repository Alternate Name schema:alternateName
20 Repository Logo URL schema:logo TRUE
21 Repository Linked Identifier @id TRUE
22 Repository Persistent Identifier schema:identifier TRUE
23 Repository Equivalent Identifier schema:sameAs TRUE
24 Repository Service Provider schema:provider TRUE
25 Founding Date schema:foundingDate TRUE
26 Funder schema:funder TRUE
27 Repository Catalog schema:hasOfferCatalog TRUE
28 Repository Address schema:address TRUE

And here's a schema.org document for the Arctic Data Center with a union of both sets of fields.

Expand for arctic-data-center.json metadata ```json { "@context": [ "https://schema.org/", { "ex": "https://example.org/" } ], "@type": [ "Service", "Organization", "ResearchProject" ], "@id": "https://arcticdata.io", "name": "Arctic Data Center", "legalName": "Arctic Data Center", "alternateName": "ADC", "logo": "https://arcticdata.io/wp-content/themes/aurora/library/images/logo_.png", "url": "https://arcticdata.io", "description": "The Arctic Data Center is the primary data and software repository for the Arctic section of NSF Polar Programs.", "identifier": [ { "@type": "PropertyValue", "name": "ROR:055hrh286", "propertyID": "https://registry.identifiers.org/registry/ror", "value": "ror:055hrh286", "url": "https://ror.org/055hrh286" }, { "@type": "PropertyValue", "name": "Re3data DOI: 10.17616/R37P98", "propertyID": "https://registry.identifiers.org/registry/doi", "value": "doi:10.17616/R37P98", "url": "https://doi.org/10.17616/R37P98" }, { "@type": "PropertyValue", "name": "wikidata:Q77285095", "propertyID": "https://registry.identifiers.org/registry/wikidata", "value": "wikidata:Q77285095", "url": "https://www.wikidata.org/wiki/Q77285095" }, { "@type": "PropertyValue", "name": "grid:grid.507882.0", "propertyID": "https://registry.identifiers.org/registry/grid", "value": "grid:grid.507882.0", "url": "https://www.grid.ac/institutes/grid.507882.0" } ], "sameAs": [ "https://ror.org/055hrh286", "https://www.grid.ac/institutes/grid.507882.0", "https://www.wikidata.org/wiki/Q77285095", "https://www.re3data.org/repository/r3d100011973", "http://doi.org/10.17616/R37P98", "urn:node:ARCTIC" ], "category": [ "Arctic Research" ], "provider": { "@id": "https://arcticdata.io" }, "contactPoint": { "@type": "ContactPoint", "name": "Support", "email": "support@arcticdata.io", "url": "https://arcticdata.io/support/", "contactType": "customer support" }, "foundingDate": "2016-02-01", "funder": { "@type": "Organization", "@id": "https://doi.org/10.13039/100000087", "legalName": "Office of Polar Programs", "alternateName": "OPP", "url": "https://www.nsf.gov/div/index.jsp?div=OPP", "identifier": { "@type": "PropertyValue", "propertyID": "https://registry.identifiers.org/registry/doi", "value": "doi:10.13039/100000087", "url": "https://doi.org/10.13039/100000087" }, "parentOrganization": { "@type": "Organization", "@id": "https://doi.org/10.13039/100000085", "legalName": "Directorate for Geosciences", "alternateName": "NSF-GEO", "url": "http://www.nsf.gov", "identifier": { "@type": "PropertyValue", "propertyID": "https://registry.identifiers.org/registry/doi", "value": "10.13039/100000085", "url": "https://doi.org/10.13039/100000085" }, "parentOrganization": { "@type": "Organization", "@id": "https://doi.org/10.13039/100000001", "legalName": "National Science Foundation", "alternateName": "NSF", "url": "http://www.nsf.gov", "identifier": { "@type": "PropertyValue", "propertyID": "https://registry.identifiers.org/registry/doi", "value": "10.13039/100000001", "url": "https://doi.org/10.13039/100000001" } } } }, "hasOfferCatalog": { "@type": "OfferCatalog", "name": "Arctic Data Center Data Catalog", "itemListElement": [ { "@type": "DataCatalog", "@id": "https://arcticdata.io/catalog/data", "name": "Arctic Data Center Catalog", "audience": { "@type": "Audience", "audienceType": "public", "name": "General Public" } } ] }, "address": { "@type": "PostalAddress", "streetAddress": "1021 Anacapa Street", "addressLocality": "Santa Barbara", "addressRegion": "CA", "postalCode": "93101", "addressCountry": "US" }, "parentOrganization": { "@type": "Organization", "@id": "https://ror.org/0146z4r19", "legalName": "National Center for Ecological Analysis and Synthesis", "alternateName": "NCEAS", "url": "http://nceas.ucsb.edu", "identifier": { "@type": "PropertyValue", "propertyID": "https://registry.identifiers.org/registry/ror", "value": "ror:0146z4r19", "url": "https://ror.org/0146z4r19" }, "parentOrganization": { "@type": "Organization", "@id": "https://ror.org/02t274463", "legalName": "University of California, Santa Barbara", "alternateName": "UCSB", "url": "http://ucsb.edu", "identifier": { "@type": "PropertyValue", "propertyID": "https://registry.identifiers.org/registry/ror", "value": "ror:02t274463", "url": "https://ror.org/02t274463" } } }, "inLanguage": "en-US", "addressCountry": "US", "license": [ "http://spdx.org/licenses/CC0-1.0", "https://spdx.org/licenses/CC-BY-4.0" ], "credentialCategory": "CoreTrustSeal", "termsOfService": [ "http://spdx.org/licenses/CC0-1.0", "https://spdx.org/licenses/CC-BY-4.0" ], "ex:persistentIdentifiers": [ "https://registry.identifiers.org/registry/doi", "https://registry.identifiers.org/registry/orcid", "https://registry.identifiers.org/registry/ror", "https://registry.identifiers.org/registry/rrid", "https://registry.identifiers.org/registry/d1id", "https://registry.identifiers.org/registry/ark" ], "ex:machineInteroperability": [ "DataONE", "OAI-PMH", "DataCite", "REST", "SPARQL" ], "ex:metadata": [ "EML", "ISO-19115", "DDI", "Dublin Core", "FGDC CSDGM", "METS", "DataCite", "OAI-ORE", "other" ], "ex:curation": "https://arcticdata.io/submit/", "ex:preservationPolicy": "https://arcticdata.io/preservation/", "ex:termsOfAccess": [ "http://spdx.org/licenses/CC0-1.0", "https://spdx.org/licenses/CC-BY-4.0" ] } ```

Note that I didn't have good mappings for the following terms, which I placed in the example namespace (ex) for now:

Thoughts on whether we should make any changes to be consistent?