These are fixes for some problem encountered while parsing some of our TextGrids:
Sometimes broken line-by-line decoding of n-byte encodings like UTF-16.
Inability to parse texts with newlines in them.
Inner text quotes escaped by doubling remaining doubled.
PR a) reworks line-by-line decoding of text format TextGrids to whole file decoding, b) enables parsing of multiline texts containing arbitrary number of newlines by repeatedly looking at more and more lines until the whole text is completed, and c) properly turns double-escaped quotes inside each read text back into single quotes after the text is read.
These are fixes for some problem encountered while parsing some of our TextGrids:
PR a) reworks line-by-line decoding of text format TextGrids to whole file decoding, b) enables parsing of multiline texts containing arbitrary number of newlines by repeatedly looking at more and more lines until the whole text is completed, and c) properly turns double-escaped quotes inside each read text back into single quotes after the text is read.