reckart / tt4j

TreeTagger for Java
http://reckart.github.io/tt4j/
Apache License 2.0
16 stars 7 forks source link

Consider filtering out very long tokens #3

Closed reckart closed 9 years ago

reckart commented 9 years ago

Original issue 3 created by reckart on 2011-05-05T08:20:11.000Z:

It seems that TT has problems with very long tokens. Consider writing a test checking what the maximum token size is and subsequently add code to tt4j that ignores such long tokens.

reckart commented 9 years ago

Comment #1 originally posted by reckart on 2011-06-03T21:46:11.000Z: