gsautter / goldengate-imagine

Automatically exported from code.google.com/p/goldengate-imagine
Other
1 stars 0 forks source link

decoding issues: word flow, block: kopfweh corpus: Ferrari_oral triptans_meta-analysis_Lancet 2001 #304

Open myrmoteras opened 7 years ago

myrmoteras commented 7 years ago

image

Ferrari_oral triptans_meta-analysis_Lancet 2001.pdf

gsautter commented 7 years ago

Looks like that horizontal line right below the references is all too close to the words and thus end up obstructing the column gap ... might be a hard one to catch, as similar lines in other circumstances intentionally hold blocks together, e.g. in tables. I'll see what I can do.

Anyway, "Split Block" should be able to swiftly fix this.

myrmoteras commented 7 years ago

the problem is the 3rd line to the left that extends too far right

myrmoteras commented 7 years ago

in the original, the words have the same distance and are not spread over the entire line

gsautter commented 7 years ago

The the problem is in the PDF proper ... observe the selection highlights (taken in Acrobat) ... these highlights are where the word bounding boxes are: image

gsautter commented 7 years ago

I'll have to investigate how Acrobat manages to render the words in the appropriate positions even though their technical positions (which correspond to the selection highlights) are misplaced.