Open ybracke opened 1 year ago
There can also be "word-internal" quotation marks with hyphenated words:
Original:
an- " ruffet
This should be normalized as:
anruft "
See discussion with Susanne on mattermost
Tokens in the DTA may be interrupted. This can be a (1) line-break (2) a line-break + a quotation mark, (3) ...?
Here is an example