cisocrgroup / PoCoTo

The CIS OCR PostCorrectionTool
Other
40 stars 4 forks source link

Cannot export due to missing image alignment #8

Open sarschu opened 7 years ago

sarschu commented 7 years ago

I have a 104 pages long PDF. We corrected it inside of PoCoTo. When I wanted to export to txt it turned out that one of the pages is corrupted. Page 28 misses the intext alignment of the image. The image, however, is there. It will also not be shown in the lower part of PoCoTo, the page count will not go up. However, when I click on one of the words, the image appears in the lower part and the word is marked. The inline image alignment never appears. Is there a way to fix that or at least to extract the corrected text from the db file?

finkf commented 7 years ago

Can you try to export the project again as plain text and send me pocoto's logfile (finfk@cis.lmu.de) so I can have a closer look on the problem?