Closed m1ci closed 8 years ago
I guess we can divide this up into:
@nilesh-c this is follow up task after #44 so having the scoring by 15th is not very possible.
Some useful stats regarding the topics in DBpedia:
918.188 distinct topics
SELECT COUNT(DISTINCT ?o)
WHERE {
?s <http://purl.org/dc/terms/subject> ?o .
}
SPARQL query sorted by number of entities with the topic
SELECT ?class (COUNT(?s) AS ?count ) WHERE {
?s <http://purl.org/dc/terms/subject> ?class
} GROUP BY ?class ORDER BY DESC(?count)
looks good to me
I think this issue is dead. lets close it.
This action is follow up of https://github.com/freme-project/e-Entity/issues/44
We need to define mechanism for scoring topics assigned to each entity. Only topics which are most relevant for the entity/document should be part of the output.