ontodev / valve.rs

A lightweight validation engine written in rust.
BSD 3-Clause "New" or "Revised" License
4 stars 0 forks source link

Prototype 'guess' operation #61

Closed jamesaoverton closed 8 months ago

jamesaoverton commented 10 months ago

VALVE will be much easier to use once we can provide a TSV file and automatically fill the 'column' table with reasonable guesses. Let's design that feature.

Assume that we start with the usual VALVE special tables, working properly. The user provides the path to a new TSV file to add (not a special table, just a data table). Assume that this file could be quite large. Here's my first pass at the next steps:

I'm still fuzzy on which of the rules should have priority.

lmcmicu commented 9 months ago

Note that since from() structures always refer to an external table, we cannot "check the distinct values for the other columns in this table" in the algorithm specified above (but the heading "check the distinct values for the other columns in other tables already loaded" is fine).