korpling / pepperModules-TreetaggerModules

This project provides an im- and an exporter to support the TreeTagger format in the linguistic converter framework Pepper (see http://corpus-tools.org/pepper/). The TreeTagger is a natural language processing tool, to annotate text with part-of-speech and lemma annotations. A detailed description of the importer can be found in section TreeTaggerImporter and a description for the exporter can be found TreeTaggerExporter.
Other
0 stars 1 forks source link

Support whitespace tokens #14

Open amir-zeldes opened 6 years ago

amir-zeldes commented 6 years ago

Currently whitespace tokens (e.g. a line containing just a space) are not imported.

amir-zeldes commented 6 years ago

This is relevant for gitdox AP export, @ctschroeder @eplatte