Closed GoogleCodeExporter closed 9 years ago
The *base classes shouldn't be used to specify the feature set, but only the
ones that end in *DFE.
Apart from that, there might really be an issue with this FeatureExtractor.
There is no test for it, so it might have gone unnoticed.
Original comment by daxenber...@gmail.com
on 5 Jun 2014 at 10:48
LucenePOSNGramDFE uses the variable topKSet (from NGramFeatureExtractorBase)
which is computed by getTopNgrams()(abstract in NGramFeatureExtractorBase).
Then getTopNgrams is overridden in LuceneNgramFeatureExtractorBase but not in
LuceneNgramFeatureExtractorBase, I guess the issue comes from this.
Original comment by narassig...@gmail.com
on 5 Jun 2014 at 11:45
This issue was updated by revision r869.
The class hierarchy of the LuceneNgram Feature Extractors was inconsistent. I
changed it to be consistent with LucenePOSNGram FeatureExtractors etc.
LucenePOSNGramFeatureExtractor has been working fine as shown in the
corresponding test, but you need to set different parameters to configure it
(as also shown in the test). I guess, this was causing the unexpected results
here.
Original comment by daxenber...@gmail.com
on 5 Jun 2014 at 2:39
The problem wasn't LucenePOSNGramFeatureExtractorBase, but rather a wrong usage
of parameters, I suppose. Please re-open if the problems persists.
Original comment by daxenber...@gmail.com
on 5 Jun 2014 at 2:41
Original issue reported on code.google.com by
narassig...@gmail.com
on 5 Jun 2014 at 10:17