tetherless-world / whyis

Whyis is a nano-scale knowledge graph publishing, management, and analysis framework.
Apache License 2.0
52 stars 30 forks source link

SHACL validation support in Whyis #237

Open mdeagen opened 3 years ago

mdeagen commented 3 years ago

Should we add a bounty for integrating a SHACL validator in Whyis?

A data curation pipeline could pass an incoming dataset through various SHACL shape graphs and display validation reports (warnings, etc.), to assist the curator in data cleaning or deciding whether a virtual column is needed (in the case of semantic data dictionary curation).

A new version of pySHACL was released a couple weeks ago https://github.com/RDFLib/pySHACL

jpmccu commented 3 years ago

I like this. The validator can mark the relevant nanopublications and link to the root entities that are examined. This would be a good first attempt at creating quality reports for the graph and do the initial modeling for that and documenting "issues" in the graph. Another one would be the Ontology hygiene checks (they're all sparql queries).

On Fri, Sep 18, 2020 at 10:51 AM mdeagen notifications@github.com wrote:

Should we add a bounty for integrating a SHACL validator in Whyis?

A data curation pipeline could pass an incoming dataset through various SHACL shape graphs and display validation reports (warnings, etc.), to assist the curator in data cleaning or deciding whether a virtual column is needed (in the case of semantic data dictionary curation).

A new version of pySHACL was released a couple weeks ago https://github.com/RDFLib/pySHACL

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/tetherless-world/whyis/issues/237, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAETCEJPAEQIIJAAOHNVU5TSGNXYDANCNFSM4RSDGZSQ .

-- Jamie McCusker (they/them/theirs)

Director, Data Operations Tetherless World Constellation Rensselaer Polytechnic Institute mccusj2@rpi.edu mccusj@cs.rpi.edu http://tw.rpi.edu