clarin-eric / ParlaMint

ParlaMint: Comparable Parliamentary Corpora
https://clarin-eric.github.io/ParlaMint/
41 stars 52 forks source link

SE: missing join right inside named entities with more tokens #808

Open matyaskopp opened 11 months ago

matyaskopp commented 11 months ago
<seg xml:id="i-GrnSWBPrj3tLpQJhZQPn1x"><!--
SKIPPING 
--> Magnus Persson.<!--
SKIPPING 
--></seg>
<seg xml:id="i-GrnSWBPrj3tLpQJhZQPn1x"><!--
SKIPPING 
-->
  <s xml:id="i-N3LX56TTvjYu7TUFndDTwa">
<!--
SKIPPING 
-->
    <name type="PER">
        <w msd="UPosTag=PROPN|Case=Nom" lemma="Magnus" xml:id="i-N1h2fYAYMwEYCwNcZFD5b2">Magnus</w>
        <w msd="UPosTag=PROPN|Case=Nom" lemma="Persson" xml:id="i-N1h2osFzjpkoYDuEsf4zT6">Persson</w> <!-- join="right" -->
    </name>
    <pc msd="UPosTag=PUNCT" xml:id="i-N1h2y2fPxJRtodt8RWWtoN">.</pc>
<!--
SKIPPING 
-->
  </s>
<!--
SKIPPING 
-->
</seg>

image

Single token entities work, see:

image