This package, dkpro-core-examples, demonstrates the use of DKPro Core components, such as readers, annotators, and writers. Each module in this project refers to a DKPro core component, providing a simple pipeline that is usable as is.
This branch uses DKPro Core version 1.9.0-SNAPSHOT.
So far, dkpro-core-examples comprises the following examples:
TokenizedTextWriter
which writes all tokens from all documents separated
by whitespaces, one sentence per line; can be used to prepare data for external tools such as Word2Vec. FrequencyCounter
and PhraseAnnotator
to detect lexical phrases in a frequency-based manner.In case you have any questions or problems with these examples, we are happy to help you -- this is a tutorial project, so we are glad to improve things and make life easier for both new and experienced DKPro Core users. The easiest ways to get in touch are the DKPro Core mailing lists or to submit an issue.