Ron89 / thesaurus_query.vim

Multi-language Thesaurus Query and Replacement plugin for Vim/NeoVim
http://www.vim.org/scripts/script.php?script_id=5341
Apache License 2.0
220 stars 23 forks source link

New Thesaurus source: YARN -- WordNet-like machine-readable thesaurus for the Russian language #18

Closed petRUShka closed 7 years ago

petRUShka commented 7 years ago

There is a crowdsourcing project: https://russianword.net/en/

Quotation:

The Yet Another RussNet (YARN) project was initiated in 2013 and aims at creating a large open WordNet-like machine-readable thesaurus for the Russian language through crowdsourcing. The project objectives include creating the thesaurus, developing free and libré open source software to operate with it, designing the necessary data schemes and models, writing technical and user documentation. The thesaurus is available under the CC BY-SA license.

There is a public HTTP API (unfortunately documentation I've found only in Russian): https://nlpub.ru/YARN/API

Google translate of that documentation (which is quite readable): https://translate.google.com/translate?hl=ru&sl=ru&tl=en&u=https%3A%2F%2Fnlpub.ru%2FYARN%2FAPI

It should be very good and stable solution for Russian language.

petRUShka commented 7 years ago

P.S. It is also accessible downloadable CSV or XML files with thesaurus, format and links on that small wiki page (in English): https://nlpub.ru/YARN/Format

Ron89 commented 7 years ago

Took a look of the links you shared. I can't figure how its online API works, and the git repository is last updated years back. But the offline source http://russianword.net/yarn-synsets.csv looks very workable. I'll come up with something next week.

Thanks for the suggestion.

Ron89 commented 7 years ago

Sorry for the long delay. As of commit 9124403, yarn-synsets.csv based thesaurus is finished and placed in higher priority than jeck_ru. To use, please download yarn-synsets.csv into folder ~/.vim/thesaurus. If it works well for you, please mark this issue as resolved.

Ron89 commented 7 years ago

Long time no feedback. Now consider this issue resolved.