ailabitmo / sempubchallenge2014-task1

A solution for the Task 1 of Semantic Publishing Challenge
MIT License
1 stars 1 forks source link

How to configure and run the parser

Required modules:

The following Python modules need to installed:

Configuration

All configuration settings should be in config.py file which should be created from config.py.example by renaming it.

Input urls

The list of input urls are set as a Python list to input_urls variable.

DBpedia dataset (with countries and universities)

Parser uses DBpedia to extract the names of countries and univeristies, and their URIs in DBpedia.

There are three options:

Run

Once you finished with the configuration you need just to execute the following script:

python CeurWsParser/spider.py

The dataset will be in rdfdb.ttl file.

Queries

SPARQL queries created for the Task 1 as translation of the human readable queries to SPARQL queries using our data model. The queries are in the wiki.

Contacts

Maxim Kolchin (kolchinmax@gmail.com)

Fedor Kozlov (kozlovfedor@gmail.com)