nltk / wordnet

Stand-alone WordNet API
Other
48 stars 15 forks source link

Note of related work: Finnish WordNet adaptation for nltk's WordNet module #9

Open frankier opened 5 years ago

frankier commented 5 years ago

FinnWordNet has been incorporated into OMW of course, but firstly it's and old version, and secondly FinnWordNet is actually aligned with PWN at the lemma level rather than the synset level.

The problem is is that FinnWordNet adds other bits of markup into its WordNet index files.

Here it is: https://github.com/frankier/finntk/blob/master/finntk/wordnet/reader.py

One thing that might be nice here is to here -- assuming you have any interest -- is to make it so it's possible to do this kind of overriding with less copypasting. Another possibility would be to add FinnWordNet support directly to this repository.

I'm not really sure what's best here but I thought I'd ask.

alvations commented 5 years ago

Let me see what I can do in the code. Any documentation of what are the extra markup in Finn WN?