LOD-Laundromat / lodlaundry.github.io

http://lodlaundromat.org
2 stars 2 forks source link

Improve schema #104

Open LaurensRietveld opened 9 years ago

LaurensRietveld commented 9 years ago

This ticket is for whenever we do a major overhaul of lod laundromat.

Things to improve: We should distinguish between -our- dataset identifier (e.g. http://lodlaundromat.org/resource/<md5>), and an identifier to the original data source (e.g. http://lodlaundromat.org/resource/<md5>/source, with a same-as relation to the original download url). There is a provenance relation between both, we should model it like that. Some properties (e.g. # warnings ) belong to the provenance relation Other properties (e.g. statementstype) belong to -our- dataset identifier. And properties such as linecount belong to the source dataset identifier

LaurensRietveld commented 9 years ago

This is now done in a post-process sparql update query. for the new crawl, make sure we change this directly in the washing machine