dbpedia / extraction-framework

The software used to extract structured data from Wikipedia
850 stars 270 forks source link

topical_concepts should use foaf:focus not skos:subject #301

Open VladimirAlexiev opened 9 years ago

VladimirAlexiev commented 9 years ago

also on mlist

topical_concepts gives the topical page for each category, eg

dbp:Category:Programming_languages skos:subject dbp:Programming_language

But there's no such property. I suggest to use foaf:focus "The underlying or 'focal' entity associated with some SKOS-described concept".

The same prop is used by Getty TGN, VIAF, FR BnF, UK BL.

VladimirAlexiev commented 9 years ago

However, now the data is gone from dbpedia.org: this query returns only a few thousand. select * {?x skos:subject ?y} But the dataset http://data.dws.informatik.uni-mannheim.de/dbpedia/2014/en/topical_concepts_en.ttl.bz2 has a lot more lines.

This one finds no topical link: select * {dbcat:Programming_languages ?p dbr:Programming_language }

And could someone explain where this skos:related came from? dbr:Category:Programming_languages skos:related dbr:Category:Programming_language_topics

IMHO the topical links are quite important for some classification tasks. E.g. WiBitaxonomy could use them strongly to tie up the page vs category hierarchy.

VladimirAlexiev commented 9 years ago

Any progress?