-
-
Currently, to use TcAnnotator in a production environment, one has to wrap it in another UIMA component which delegates to the TcAnnotator in its `process` method and after running it picks up the `Te…
-
The `maximum Frequency` value is not reseted when `clear()` is called which might lead to a wrong `most frequent sample`.
https://github.com/dkpro/dkpro-core/blob/master/dkpro-core-api-frequency-as…
-
Hi! When trying to use StanfordPosTagger on an es-language CAS, we get a crash in dkpro, both with 1.8.0 and master when trying to load the model resource.
The resource seems to be provisioned fine:
…
pasky updated
6 years ago
-
All numeric feature values should be in the range of 0..1 or -1..1, especially for SVMs. The boolean nature of most features fulfills the former range but especially length/count features are not norm…
-
The information whether an item was split due to camel case notation is lost once the CamelCaseTokenSegmenter has split the tokens. It will require at least an additional traversal using the same logi…
-
The Jenkins build on zoidberg is failing with the following errors. It doesn't seem to have anything to do with recent commits; the build fails even when reverting to the last successfully built commi…
-
I think it would be a good idea to mark component parameters which could pose security problems, e.g. because they accept paths/directory names and would access the file system or classpath. While the…
-
Add keras as deep learning framework. Keras (https://keras.io) is a more easy to use deep learning project which uses Theano or Tensorflow in the background. It is furthermore a Python implementation.…
-
The MLA to be used should be defined as dimension. This would allow running several classifiers, not just parametrization of the same classifier.
This is essentially already possible by using an ou…