pldn / LDWizard

🧙 LDWizard: A generic framework for simplifying the creation of linked data. Supported by the PLDN community.
European Union Public License 1.2
13 stars 7 forks source link

Validate datatype IRIs for columns #21

Open GerwinBosch opened 4 years ago

GerwinBosch commented 4 years ago

Use case

As a user, I want the datatype IRIs that are suggested to me to be limited to ones that actually apply for the values in the corresponding column. When I enter a datatype IRI myself, I want to be notified in case this will result in incorrect linked data.

To implement

Blocked by netwerk-digitaal-erfgoed/LDWizard#42

wouterbeek commented 4 years ago

I can also imagine a situation in which it is simply not possible to select a datatype IRI if the data does not conform to the denoted datatype (as we currently do for key column selection). Then this validation would not be needed. See pldn/LDWizard#24 for the automatic detection of possible datatype IRIs based on the source data.

philipperenzen commented 1 year ago

Additional implementation (from #24)

e.g., columns that contain values of the format YYYY-MM-DD could be xsd:date, columns with only whole numbers could be xsd:integer.

See https://github.com/DHLab-nl/guess_xsd_type for an existing auto-detection approach.

To discuss: what if multiple different datatypes are in the same column? For example: "01-01-1780" (xsd:date) and "02-1230" (xsd:gYearMonth)