NRGI / resource-projects-etl

ETL processes for rp.org
GNU General Public License v2.0
3 stars 2 forks source link

Considering provenance vs. auditing trail #4

Closed anderspeders closed 8 years ago

anderspeders commented 9 years ago

Provenance can be documented by sources from across time.

The auditing trail should include:

timgdavies commented 9 years ago

We should soon have provenance on a number of levels:

(1) In the input templates, most rows ask for the Source to be given, linking to the class to a given source;

(2) The conversion process logs which sheet and row data came from;

(3) Each imported file is loaded into it's own named graph, which allows us to identify where each item of data originated;

timgdavies commented 8 years ago

Pages all have a Source which gives provenance to the graph, source and file number.