ClearTK / cleartk

Machine learning components for Apache UIMA
http://cleartk.github.io/cleartk/
Other
129 stars 58 forks source link

NGram / Frequency queries like DKPro #291

Open bethard opened 9 years ago

bethard commented 9 years ago

Original issue 293 created by ClearTK on 2012-03-02T15:10:27.000Z:

I was looking at the DKPro functionality, and I saw this page about how to work with NGrams in DKPro http://code.google.com/p/dkpro-core-asl/wiki/WorkingWithNGrams

While we have all of this functionality wrapped up in feature extractors, sometimes it's nice to just work with the raw ngrams. So the question for this issue is should we be implementing our own version of components like NGramIterable and FrequencyCountProvider in ClearTK?

Of course if we shared a common type system, we could just use the DKPro utilities directly.

bethard commented 9 years ago

Comment #1 originally posted by ClearTK on 2012-07-19T09:14:46.000Z:

The resources should be type-agnostic, so you should be able to use that as you like in ClearTK. You should be able to use the Web1tFormatWriter with ClearTK as well, just make sure you set the mandatory PARAM_INPUT_TYPES (which seems not to have been mandatory at the time the example liked above had been written).

bethard commented 9 years ago

Comment #2 originally posted by ClearTK on 2012-07-24T17:35:58.000Z:

<empty>

bethard commented 9 years ago

Comment #3 originally posted by ClearTK on 2013-05-03T08:44:33.000Z:

<empty>

bethard commented 9 years ago

Comment #4 originally posted by ClearTK on 2014-03-15T17:41:52.000Z:

<empty>