own-pt / cl-wnbrowser

A collaborative editor for OpenWordnet-PT.
http://openwordnet-pt.org
Other
7 stars 8 forks source link

Incorporate polysemy information #103

Closed fcbr closed 4 years ago

fcbr commented 9 years ago

per @vcvpaiva

Need to define how to incorporate this into the current UI / db.

arademaker commented 9 years ago

More info would be nice here. I didn't understand the request.

vcvpaiva commented 9 years ago

According to https://www.aclweb.org/anthology/W/W14/W14-0102.pdf PWN has 118.695 synsets 206.979 words. 82.3 % of synsets is monossemic= 97,329 synsets. But when using OWN-PT and investigating PWN I find only 63,848 synsets with a single word.
This is not a problem per se, but it would be good to know which ones are the monosemic synsets in English and whether they are the same as the ones in Portuguese. This will help establish mappings from English to Portuguese and other languages, as monosemic synsets are very easy to translate. the problem is to discover how many they are and how polysemic the rest of the synsets are.

arademaker commented 4 years ago

It is not clear what is this issue. I am closing it but will open one about the last comment from valeria