getalp / UFSAC

UFSAC is a resource containing all WordNet Sense Annotated Corpora, and a Java library for manipulating them
MIT License
37 stars 4 forks source link

UFSAC: Unification of Sense Annotated Corpora and Tools

This repository contains the dataset of the article named "UFSAC: Unification of Sense Annotated Corpora and Tools", written by Loïc Vial, Benjamin Lecouteux and Didier Schwab, for the 11th edition of the Language Resources and Evaluation Conference (LREC) that took place in May 2018 in Miyazaki, Japan.

The full article is available at the following URL: http://www.lrec-conf.org/proceedings/lrec2018/summaries/250.html.

Content of the repository

This repository contains:

Get Started

If you want to use the Java API or the scripts, the prerequisites are:

Once they are installed, you must compile the code:

And if you want to use the library as a dependency in another Maven projects:

Version history

Version 2.1 (October 2018)

Direct link to the data: https://drive.google.com/file/d/1kwBMIDBTf6heRno9bdLvF-DahSLHIZyV

Version 2.0.0 (July 2018)

Version 1.1.0 (June 2018)

Direct link to the data: https://drive.google.com/file/d/1XKOnRPnm0TSia1PKwe2xsGE4IDqvAAbb

Version 1.0.0 (May 2018)

Direct link to the data: https://drive.google.com/file/d/1-II0demgruLdSdI8SC6dmnIqDNrZvdpW

Original version which contains the following corpora:

Plus the code to produce the UFSAC version from the original version of the following corpora: