asoroa / ukb

Ukb: graph-based WSD and similarity
105 stars 28 forks source link

UKB: Graph Based Word Sense Disambiguation and Similarity

Build Status License: GPL v3

UKB is a collection of programs for performing graph-based Word Sense Disambiguation and lexical similarity/relatedness using a pre-existing knowledge base. Currently, UKB runs in linux and macos.

UKB has been developed by the IXA group in the University of the Basque Country. UKB applies the so-called Personalized PageRank on a Lexical Knowledge Base (LKB) to rank the vertices of the LKB and thus perform disambiguation. The details of the method are described in [1]. It has also been applied on WSD on specific domains [2,5], Named Entity Disambiguation [6] and to obtain graph embeddings [7] . The algorithm can also be used to calculate lexical similarity/relatedness of words/sentences. See [3,4,6] for applications of UKB to similarity.

Visit http://ixa2.si.ehu.es/ukb/ for more information about UKB.

The latest source code for git can be found here:

https://github.com/asoroa/ukb.git

References

Files under this catalogue:

Check README files in the respective catalogue.

Getting the resources

WordNet

Wikipedia

Replicating UKB results

The "wsdeval/" directory contains the scripts to reproduce UKB results evaluated in the WSD Evaluation Framework dataset. See "wsdeval/README" and follow the instructions.