komalsahedani / cleartk

Automatically exported from code.google.com/p/cleartk
0 stars 0 forks source link

cleartk-opennnlp-tools should just be thin layers over opennlp.uima #40

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
The uima wrappers being developed by the opennlp folks are really nice. 
Basically, we can rip out our lame postagger wrapper and replace it with a
descriptor file that specifies our type system.  

See http://opennlp.cvs.sourceforge.net/viewvc/opennlp/opennlp.uima/

We will likely need to build the code ourselves for now keeping careful
track of which revision we are using and any changes we've made.  

Original issue reported on code.google.com by pvogren@gmail.com on 6 Feb 2009 at 2:56

GoogleCodeExporter commented 9 years ago

Original comment by pvogren@gmail.com on 27 Oct 2009 at 2:52

GoogleCodeExporter commented 9 years ago
I took another look at this and couldn't quite figure out how to bind the 
resource to the AED.  I think this will be much easier to accomplish when the 
apache version is released because the relevant code in opennlp-uima has been 
refactored since the 1.4.3 release and I didn't feel like finding the 1.4.3 
code.  

Anyways, what will happen is that the current PosTaggerAnnotator class will be 
completely gutted out except that a new getDescription() method will be added.  
I will commit some scratchings for the new version of this method and commented 
them out.  

Original comment by pvogren@gmail.com on 14 Jan 2011 at 8:04

GoogleCodeExporter commented 9 years ago

Original comment by pvogren@gmail.com on 14 Jan 2011 at 9:22

GoogleCodeExporter commented 9 years ago
Looks like this is relatively easy to do now with external resources in uimaFIT 
1.3. I committed an example, for the OpenNLP Tokenizer, in r3462. It's 
basically just filling in the ClearTK type system to the OpenNLP annotator 
parameters. (BTW, I put the new type in the package org.cleartk.opennnlp 
instead of org.cleartk.syntax.opennnlp so that it matches the restructuring we 
agreed upon for the maven modules.)

We should do the same for the SentenceDetector, POSTagger, NameFinder, Chunker 
and Parser.

Original comment by steven.b...@gmail.com on 22 Jan 2012 at 11:38

GoogleCodeExporter commented 9 years ago

Original comment by steven.b...@gmail.com on 25 Jul 2012 at 12:59

GoogleCodeExporter commented 9 years ago

Original comment by lee.becker on 15 Jan 2013 at 7:30

GoogleCodeExporter commented 9 years ago

Original comment by steven.b...@gmail.com on 3 May 2013 at 8:44

GoogleCodeExporter commented 9 years ago

Original comment by phi...@ogren.info on 15 Mar 2014 at 5:41