LOD-Laundromat / lodlaundry.github.io

http://lodlaundromat.org
2 stars 2 forks source link

Base URI not set #89

Open LaurensRietveld opened 9 years ago

LaurensRietveld commented 9 years ago

When a base URI is not set, we might end up with URIs such as

<http:/scratch/lodlaundromat/crawls/12/69e7c7ccdc8f0b373325d5acf3c27b26/dirty>

I.e., the file path of the dirty file is used as base uri.

Preferably, when a document does not contains a base URI, add one ourselves. My suggestion:

`http://lodlaundromat.org/.base/<md5