Open cygri opened 10 years ago
There are three parts to this:
This comment focuses on the second point.
The key is the LOVWrapper
class. This is a wrapper around a VocabularyTermExtractor
. The VocabularyTermExtractor
iterates over class/property descriptions extracted from an RDFS/OWL Model. Now the LOVWrapper
modifies these descriptions with LOV-specific stuff. For example, it adds a “vocabulary” field to the JSON with information about the vocabulary that defines the term. Here you could also add scoring information. The best way to do that is probably:
Describer
(similar to TermDescriber
) that adds scores for a given class/property. Perhaps call it TermLOVScoreDescriber
or somesuch.Describer
in the LOVWrapper
constructor, and invoke it in modifyDocument()
Describer
, you will need to pass the SPARQLRunner
from LOVExtractor
to LOVWrapper
so that the Describer
has access to the full LOV dataset including the scoring information in named graphs.
LOV contains some statistics on vocabulary usage that should be excellent for improved ranking.