DrDub / cleartk

Automatically exported from code.google.com/p/cleartk
0 stars 0 forks source link

example POS tagger example uses preceding and following stems rather than words #306

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
I'm pretty sure that I found that using preceding and following word stems does 
not work as well as using the actual words themselves.  I changed this in the 
DefaultFeatureExtractor in the org.cleartk.token.pos.impl package.  

This is really low priority because it is, after all, an example part-of-speech 
tagger.  But it's easy enough to fix.  

Original issue reported on code.google.com by philip.o...@oracle.com on 10 May 2012 at 8:50

GoogleCodeExporter commented 9 years ago
I went ahead and fixed this - even if it makes the example slightly less 
interesting.  Since having stem features in a part-of-speech tagger is just 
plain wrong - I don't like having it in the code we send users to in our 
tutorial which is where they will likely start.

Original comment by philip.o...@oracle.com on 10 May 2012 at 11:03

GoogleCodeExporter commented 9 years ago

Original comment by steven.b...@gmail.com on 5 Aug 2012 at 8:47