dbpedia / GSoC

Google Summer of Code organization
37 stars 27 forks source link

Workflow for linking external datasets #5

Closed mgns closed 4 years ago

mgns commented 6 years ago

Description

DBpedia is long time known as a central hub in the Linked Open Data (LOD) cloud as there are numerous datasets linked by DBpedia and even more linking to DBpedia resources. Nevertheless, outgoing links are highly appreciated. In the DBpedia Links repository people can upload there linksets or scripts to generate such. What is currently missing is a toolset to create links between DBpedia and external datasets automatically and to review them if needed. State-of-the-art tools for link creation between datasets are SILK and LIMES. The student should create a workflow for automatic link generating and curation of automatically generated links, given there is a new dataset that should be linked to DBpedia. This workflow should be supported by a web-based GUI.

There are two levels of linking:

  1. schema level: linking external vocabularies to the DBpedia ontology
  2. instance level: linking resources from external datasets to DBpedia resources Both levels can be considered in this project.

    Goals

Impact

Get better links from DBpedia to external datasets.

Warm up tasks

Mentors

Keywords

data quality, linking

umairq commented 5 years ago

Please suggest some warmup tasks and mentors, please? thanks