Swirrl / table2qb

A generic pipeline for converting tabular data into rdf data cubes
Eclipse Public License 1.0
13 stars 4 forks source link

Provide ways to hide/redirect csvw metadata in output #85

Closed RickMoynihan closed 5 years ago

RickMoynihan commented 5 years ago

e.g. we see lots of stuff like this... I think because some of these pipelines are run with new data, the blank nodes and csvw:url's are growing at quite a rate... some of our datasets probably contain a few hundred, and counting at least.

screen shot 2018-11-07 at 22 02 39

I know this output is sometimes useful, so it might be worth trying to redirect into a different graph which we can optionally load. Or maybe options for running in csvw minimal mode?

Robsteranium commented 5 years ago

We can't really use that audit data while it's full of blank nodes and references to temp files. Eventually it would be great to store the uploaded csvw and use this together with validations to point to broken cells etc. For now though, it's clutter.

Issue #96 would see us record all resource data in the csv tables, allowing us to use minimal mode.

In the mean time we need to use the custom annotated mode from here https://github.com/Swirrl/csv2rdf/issues/17.

Robsteranium commented 5 years ago

Fixed with #104.