Closed vanatteveldt closed 8 years ago
Fixed by generaring automatically the sentence identifiers "1|text text text". Alpino only gives special meaning to the first occurrence of the symbol "|", so the rest of symbols in the text do not raise any exception
Sentence containing a pipe symbol / vertical bar (
|
) are not processed correctly. Alpino uses this character to indicate line id's, so if a sentence contains a pipe the left hand side is treated as an id, containing the1.xml
to not be found, and the parse is not included in the output:This results in the following output (without terms):
Note that this is not an error condition, so the "not found the file" does not raise an exception (which it probably should?)
(this seems to be the root cause of https://github.com/ixa-ehu/ixa-pipe-nerc/issues/11)