NRGI / resource-projects-etl

ETL processes for rp.org
GNU General Public License v2.0
3 stars 2 forks source link

Ignore provenance data for certain classes #33

Closed timgdavies closed 8 years ago

timgdavies commented 8 years ago

Right now the provenance features of TagLifter.py mean that all entities discovered in a file are linked to the row in the file they around found in.

So - for example, a country might have 100s of prov:wasDerivedFrom records.

We should have a mechanism, either at load time, or at data creation point, to drop provenance data from certain classes.

timgdavies commented 8 years ago

TagLifter now respects the captureProvenance property on classes. Where not set, or false, provenance information is ignored.