Open yakazimir opened 4 years ago
Just an update: in terms of nltk's wordnet mapping using synset_from_sense_key
, something seems to be wrong.
Your gloss/id pair is consistent with wordnet when I searched here: http://wordnetweb.princeton.edu/perl/webwn?s=climate&sub=Search+WordNet&o2=&o0=1&o8=1&o1=1&o7=&o5=&o9=&o6=1&o3=&o4=&h=0000 .
This issue is mentioned here: https://github.com/nltk/nltk/issues/1934
I'm trying to rebuild your data, and noticed in the ALL.dict.xml (which, as I understand, contains all of the lemmas, glosses and word senses used in all the semeval data), you have entries such as the following:
Where climate#n is the lemma and pos. It says here that the
sence_count_wn=2
, however, there is only one sense inside of lexelt. Shouldn't there be all of the 2 sense entries inside oflexelt
? My assumption is that each lexelt should have all of the different WN senses and glosses of the lemma listed initem
.I also notice that when I look up this word in nltk's wordnet (which I see that you also use), I get a different definition for
climate%1:26:00::
: