Character escaping for comments (in alpino_dependency.py and convert_penn_to_kaf.py) currently replaces '--' by '-'. This leads to a ValueError with lxml.etree when documents contain longer dash sequences, e.g., '------'.
Perhaps we could use '&ndash' as a replacement for '--'?
Character escaping for comments (in
alpino_dependency.py
andconvert_penn_to_kaf.py
) currently replaces '--' by '-'. This leads to a ValueError with lxml.etree when documents contain longer dash sequences, e.g., '------'. Perhaps we could use '&ndash' as a replacement for '--'?