NCEAS / metadig-checks

MetaDIG suites and checks for data and metadata improvement and guidance.
Apache License 2.0
9 stars 9 forks source link

Need check for schema.org discovery metadata #106

Open daniels303 opened 5 years ago

daniels303 commented 5 years ago

Description

Need to check for the existence of JSON-LD schema.org metadata as well.

Priority

Procedure

Checks can be made by going to the dataset's landing page and checking for schema.org/JSON-LD metadata.

daniels303 commented 5 years ago

This needs a "findable" label...

mbjones commented 4 years ago

@gothub The ESIP science-on-schema.org cluster lead by @ashepherd has been developing guidelines for best practices on including schema.org on Dataset landing pages. In addition to a presence/absence check, we should also verify compliance with the ESIP schema.org guidelines, which are about to be released as version 1.1, and then shortly thereafter as version 1.2.

The schema.org cluster is developing a set of SHACL shape constraints to express conformance rules, and so a simple implementation would be to add a check to the check engine that runs the SHACL shapes against the harvested document. @datadavev has implemented such a checker for DataONE, and is working with @fils who has a tool called Fence that performs similar actions. It would be good to incorporate these into the quality suite and be able to report on them. I think Dave's tool is a wrapper around py-shacl and so should be fairly straightforward to add into one of our checks.