Closed kosloot closed 3 years ago
Given this FoLiA:
<?xml version="1.0" encoding="UTF-8"?> <FoLiA xmlns:xlink="http://www.w3.org/1999/xlink" xmlns="http://ilk.uvt.nl/folia" xml:id="hbr" generator="libfolia-v2.8" version="2.4.0"> <metadata type="native"> <annotations> <paragraph-annotation/> <text-annotation set="https://raw.githubusercontent.com/proycon/folia/master/setdefinitions/text.foliaset.ttl"/> <part-annotation/> <hyphenation-annotation/> </annotations> </metadata> <text xml:id="hbr.text"> <p xml:id="hbr.text.p"> <part xml:id="hbr.text.part.1" space="no"> <t>White<t-hbr/>water Moun<t-hbr/></t> </part> <part xml:id="hbr.text.part.2"> <t>tains.</t> </part> </p> </text> </FoLiA>
the Pyton function folia2txt (rightfully) extracts the text: Whitewater Mountains.
Whitewater Mountains.
But it's C++ counterpart FoLiA-2text extracts: Whitewater Moun tains. ignoring the space="no". This is most probably a bug in libfolia.
Whitewater Moun tains.
seems fixed in libfolia now.
Given this FoLiA:
the Pyton function folia2txt (rightfully) extracts the text:
Whitewater Mountains.
But it's C++ counterpart FoLiA-2text extracts:
Whitewater Moun tains.
ignoring the space="no". This is most probably a bug in libfolia.