saad120 / dkpro-wsd

Automatically exported from code.google.com/p/dkpro-wsd
0 stars 0 forks source link

Evaluators fail if the input is not POS-tagged #41

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
The cluster evaluator assumes the input is POS-tagged, and fails in 
getTestAndGoldResults() if it is not.  This is because it stores results in a 
hash table keyed by part of speech; since the POS is null we get a 
NullPointerException.

The problem may exist for other evaluators as well.

Original issue reported on code.google.com by tristan.miller@nothingisreal.com on 16 Oct 2013 at 10:45

GoogleCodeExporter commented 9 years ago
Attached test case.

Original comment by tristan.miller@nothingisreal.com on 16 Oct 2013 at 10:49

Attachments:

GoogleCodeExporter commented 9 years ago
Problem also affects AbstractSingleExactMatchEvaluator

Original comment by tristan.miller@nothingisreal.com on 17 Oct 2013 at 12:40

GoogleCodeExporter commented 9 years ago
Possible solution would be to add a new element to the POS enumeration to 
represent none/unknown POS.  However, we would need to carefully check the rest 
of our code to make sure nothing relies on there being only "real" parts of 
speech.

Original comment by tristan.miller@nothingisreal.com on 17 Oct 2013 at 12:49

GoogleCodeExporter commented 9 years ago

Original comment by tristan.miller@nothingisreal.com on 20 Nov 2013 at 3:03

GoogleCodeExporter commented 9 years ago

Original comment by tristan.miller@nothingisreal.com on 4 Nov 2014 at 11:45