LanguageMachines / libfolia

FoLiA library for C++
https://proycon.github.io/folia
GNU General Public License v3.0
15 stars 7 forks source link

Dellimitter problem in corrected text #38

Closed kosloot closed 4 years ago

kosloot commented 4 years ago

given this example:

<?xml version="1.0" encoding="UTF-8"?>
<FoLiA xmlns:xlink="http://www.w3.org/1999/xlink" xmlns="http://ilk.uvt.nl/folia" xml:id="doc" generator="libfolia-v1.11" version="2.2">
  <metadata type="native">
    <annotations>
      <correction-annotation />
      <text-annotation />
      <sentence-annotation />
      <token-annotation />
    </annotations>
  </metadata>
  <text xml:id="bug">
    <s xml:id="s.1">
      <t class="in">Dit is een test</t>
      <w xml:id="w.1">
        <t class="in">Dit</t>
      </w>
      <w xml:id="w.2">
        <t class="in">is</t>
      </w>
      <correction>
        <original>
          <w xml:id="w.3">
            <t class="in">een</t>
          </w>
        </original>
      </correction>
      <w xml:id="w.4">
        <t class="in">test</t>
      </w>
    </s>
  </text>
</FoLiA>

follialint gives this erroneous message:

tests/textproblem_2.xml failed: inconsistent text: node s(s.1) has a mismatch for the text in set:in
the element text ='Dit is een test'
 the deeper text ='Dit is eentest'

apparantly, the dellimitter is lost somewhere.

kosloot commented 4 years ago

assume the solution is correct.