The SPARQL service description offers information about a SPARQL service which should be used by the crawler. The following triples contain the URI http://dbpedia.org/sparql as well as the information that this URI should be crawled as SPARQL endpoint:
:DBpedia a sd:Service;
sd:endpoint <http://dbpedia.org/sparql> .
This issue overlaps with #118
Solution
At the moment, the RDF processing is not very intelligent and simply stores all newly retrieved URIs. We would have to enhance the processing, e.g., by using the decorator pattern and decorator classes which handle special cases like that. However, depending on the number of special cases we will have in the future, the decorator pattern might become to heavy.
Additionally, the storage of newly found URIs might be an issue. In the example above, it wouldn't be sufficient to add the type information since the URI could already have been found before and the update might be rejected by the store. The same holds for the Frontier's queue which might reject the appending of the new URI since it might already be known. In this case, update strategies might be necessary.
Description
The SPARQL service description offers information about a SPARQL service which should be used by the crawler. The following triples contain the URI
http://dbpedia.org/sparql
as well as the information that this URI should be crawled as SPARQL endpoint:This issue overlaps with #118
Solution