shilad / wikibrain

The WikiBrain Java library enables researchers and developers to incorporate state-of-the-art Wikipedia-based algorithms and technologies in a few lines of code.
http://shilad.github.io/wikibrain/
Other
91 stars 54 forks source link

CategorySimilarity performance improvements #238

Open shilad opened 9 years ago

shilad commented 9 years ago

The call to get the categories associated with a page in the Bfs searcher class are slow. Use the graph object, or implement a special cosimilarity method that uses an approach like mostSimilar