calzada / PARLAMINT-ES-MC

2 stars 4 forks source link

transcriber notes: spaces and full stop #42

Closed matyaskopp closed 1 year ago

matyaskopp commented 1 year ago

There are two issues that make notes strange:

  1. source wrongly mark full stop after the note as a part of the text (not note)

https://www.congreso.es/public_oficiales/L14/CONG/DS/PL/DSCD-14-PL-248.PDF image https://github.com/calzada/PARLAMINT-ES-MC/blob/010ef3c20813a1d733516544502021e218449b83/CD/CD230223.xml#L229-L232

  1. conversion removes space before note and adds space after note:

https://github.com/calzada/PARLAMINT-ES-MC/blob/010ef3c20813a1d733516544502021e218449b83/ParlaMint-ES.sample.TEI/ParlaMint-ES_2023-02-23-CD230223.xml#L156-L165

The best solution is probably to remove the full stop after the note if ! / . / ? is before note. This would cause (in most cases) moving notes outside the paragraph

matyaskopp commented 1 year ago

space before a note is removed with parlamint2root.xsl

matyaskopp commented 1 year ago

space after a note is added with notefixin-scripts

matyaskopp commented 1 year ago

fixed