plazi / GoldenGATE-Imagine

A GUI Tool For Freeing Text and Data from PDF Documents
Other
5 stars 0 forks source link

Text flow problems #48

Open millerjeremya opened 1 year ago

millerjeremya commented 1 year ago

@myrmoteras @FLSimoes In this treatment https://treatment.plazi.org/id/03C8879D-FF9F-D26E-82F4-FCCC0FE7F945 There is a word flow problem between the first and third pages. The issue appears to be Figure 1, on the page between, which is rotated and the caption text flow is wrong because of this. Can someone help me fix this? Here is a screen shot showing the problematic text in TreatmentBank image

gsautter commented 1 year ago

The main text stream seems to be connected alright between pages 1 and 3, but there sure was an issue with the caption on page 2, which was only partially marked as such, and partially as plain text. To resolve such issues, "Edit Page in Sub-Window" comes in handy, especially as it lets you rotate the page by 90°, 180°, or 270° if required. A few other issues were with materialsCitations spanning across subSubSection and paragraph boundaries ... adjusted the latter to accommodate the former.

flsimoes commented 1 year ago

Let me know if all is sorted or if I need to do anything else

gsautter commented 1 year ago

I think all the text flow problems are sorted, as they actually were annotation nesting problems ... as to the rest, I'm not really sure, but the QC protocol looked OK.