request4.txtout4.txthtml-in.txt
The attached request with the attached html input leads to segementation issues. E.g. "Yves Desmet" is correctly recognized, but the output dublicates the last character "t":
<span
its-ta-ident-ref="http://nl.dbpedia.org/resource/Yves_Desmet"
its-ta-class-ref="http://nerd.eurecom.fr/ontology#Person"
its-ta-confidence="0.9869452659937508">Yves Desmet</span>t
This seems to be a regular error, see the output for details.
request4.txt out4.txt html-in.txt The attached request with the attached html input leads to segementation issues. E.g. "Yves Desmet" is correctly recognized, but the output dublicates the last character "t": <span its-ta-ident-ref="http://nl.dbpedia.org/resource/Yves_Desmet" its-ta-class-ref="http://nerd.eurecom.fr/ontology#Person" its-ta-confidence="0.9869452659937508">Yves Desmet</span>t This seems to be a regular error, see the output for details.