Closed GoogleCodeExporter closed 9 years ago
- Improved handling of a dying TreeTagger process.
- Added setting to control the maximum token length (in bytes) - per default
90000.
- Empirically determined that at least on my machine the maximum token length
is 99998. I expect that there is a 100000 byte buffer in TreeTagger- this
corresponds to 99998 one-byte characters + line-break + ZERO (end of string in
C).
Original comment by richard.eckart
on 3 Jun 2011 at 9:46
Original issue reported on code.google.com by
richard.eckart
on 5 May 2011 at 8:20