Swirrl / table2qb

A generic pipeline for converting tabular data into rdf data cubes
Eclipse Public License 1.0
13 stars 4 forks source link

Remove rdfs:range from components-pipeline #100

Open Robsteranium opened 5 years ago

Robsteranium commented 5 years ago

The use of rdfs:range and the classize transformation for describing components was taken from the sns-graft pipeline where we all apply a class to the codes as part of a single loading pipeline.

I had originally hoped this would support validation, but I have since learned that rdfs:range is actually used to make inferences, not to validate constraints. In other words:

given the tbox: dim:age rdfs:range class:Age and abox with (invalid data): obs1 dim:age code:Male inference will give you (false, but consistent data): code:Male a class:Age

I now think we should validate integrity constraints by another means (see #97 for more).

We should, therefore, remove the rdfs:range statements and classize function etc.