Closed GoogleCodeExporter closed 9 years ago
I've committed this API under org.cleartk.classifier.chunking. I'd like to
deprecate the old chunker. What do you guys think?
For comparison, look at the changes in TimeAnnotator:
http://code.google.com/p/cleartk/source/browse/trunk/cleartk-timeml/src/main/jav
a/org/cleartk/timeml/time/TimeAnnotator.java?r=3887
http://code.google.com/p/cleartk/source/browse/trunk/cleartk-timeml/src/main/jav
a/org/cleartk/timeml/time/TimeAnnotator.java?r=3888
I think the biggest improvement is in just being able to write code like you
would for any other CleartkSequenceAnnotator (that and getting rid of a crazy
number of hard-to-understand UIMA parameters).
Original comment by steven.b...@gmail.com
on 21 Apr 2012 at 12:35
Steve,
This looks really great. I was actually looking at the chunker recently and
got discouraged because it was so complicated - and I wrote it! This looks way
easier and I am happy for you to deprecate the old approach.
This might be a good time to rip out the chunk tokenizer which I doubt anyone
is using. For something like this it might suffice to send out an email to the
user's list to see if anyone cares about it and if no one responds, then we
simply remove it.
Original comment by phi...@ogren.info
on 21 Apr 2012 at 6:01
This issue was closed by revision r3889.
Original comment by steven.b...@gmail.com
on 21 Apr 2012 at 6:53
I deprecated the old chunker classes, as well as the chunk tokenizer. I've also
opened Issue 303 to make sure that we eventually delete the chunk tokenizer.
It's probably good practice to leave it in, deprecated, for one release before
we rip it out.
Original comment by steven.b...@gmail.com
on 21 Apr 2012 at 6:57
Original comment by steven.b...@gmail.com
on 5 Aug 2012 at 8:50
Original issue reported on code.google.com by
steven.b...@gmail.com
on 19 Apr 2012 at 3:26