tallforasmurf / PPQT

A post-processing tool for PGDP written in Python, PyQt4, and Qt
GNU General Public License v3.0
4 stars 2 forks source link

page markers end up in the wrong spot #163

Open bibimbop opened 10 years ago

bibimbop commented 10 years ago

If a page marker happens between paragraphs, inserting the html page number ends up in the previous paragraph.

For instance:

vieil adage: <i>Un gentilhomme en vaut
un autre.</i>
<span class='pagenum'><a id='Page_398' name='Page_398'>[398]</a></span>
</p>

<p>
<i>Interrogatoire et réponse de M. de Thou
à Monseigneur le Cardinal-Duc, qui

instead of the desired result:

vieil adage: <i>Un gentilhomme en vaut
un autre.</i>
</p>

<p>
<span class='pagenum'><a id='Page_398' name='Page_398'>[398]</a></span>
<i>Interrogatoire et réponse de M. de Thou
à Monseigneur le Cardinal-Duc, qui
tallforasmurf commented 10 years ago

Not sure what to do about this. It's a long-standing problem, I think with GG as well. I put a note about it in the "suggested workflow" document, step 25, "After inserting the text, do the following searches to correct bad folio placement..."

Fixing would mean for pqPages.insertMarkers to do some parsing of the surrounding text. Not sure if this can be done reliably for every case. I'll leave this issue open for now.