A tokenizer and sentence splitter for German and English web and social media texts.
GNU General Public License v3.0
135
stars
21
forks
source link
Added roman ordinals, abbreviation "Art." preceding numbers #23
Closed
AndreasBlombach closed 1 year ago
As discussed. :)