own-pt / own-en-legacy

The openWordnet-EN, a converted and expanded PWN
MIT License
0 stars 1 forks source link

We should remove suffixes from labels #16

Closed fcbr closed 7 years ago

fcbr commented 7 years ago

I just imported a partial conversion to RDF noticed that the labels of the words also contain the suffixes used to disambiguate the sense ids. These suffixes should be removed from the labels. Pay attention to "words" that are simply numbers!

Example:

<https://w3id.org/own-pt/wn30-en/instances/word-snappy4> 
      <https://w3id.org/own-pt/wn30/schema/lexicalForm> "snappy4" .

should be:

<https://w3id.org/own-pt/wn30-en/instances/word-snappy4> 
      <https://w3id.org/own-pt/wn30/schema/lexicalForm> "snappy" .
arademaker commented 7 years ago

is it really necessary to expose the 4 ?

The word should be

<https://w3id.org/own-pt/wn30-en/instances/word-snappy> 
      <https://w3id.org/own-pt/wn30/schema/lexicalForm> "snappy" .

don't you think?

fcbr commented 7 years ago

That means that we would share words between senses? That may work...

hmuniz commented 7 years ago

I guess the e1a6511b3a0ae28fb4c821216259092cfd8ca179 and 07519963f769e2ba5010546fec081c7ed83cee72 removed suffixes from labels.

arademaker commented 7 years ago

@fcbr yes, words should have only one instance connected with more than one sense when they are polysemic. Related with #17 ? can we close this issue ?