Jochre has a feature of sort of "unword-wrapping" text. The feature helps assure that the resulting text, when subject to word wrapping by a typical modern word processor, will appear properly formatted, with paragraph boundaries where they should be and linebreaks based on line width limits where they should appear.
While this is an impressive and welcome feature in many cases, it can be undesirable in certain cases:
(1) this makes comparison with the original more difficult. When humans put OCR output into standard word processors, the loss of the linebreaks based on line width means that you cannot easily visually compare lines in the source image with lines in the output.
(2) In poems each line should normally be preserved.
Can there be an option to preserve all line breaks? This is a feature request.
Jochre has a feature of sort of "unword-wrapping" text. The feature helps assure that the resulting text, when subject to word wrapping by a typical modern word processor, will appear properly formatted, with paragraph boundaries where they should be and linebreaks based on line width limits where they should appear.
While this is an impressive and welcome feature in many cases, it can be undesirable in certain cases:
(1) this makes comparison with the original more difficult. When humans put OCR output into standard word processors, the loss of the linebreaks based on line width means that you cannot easily visually compare lines in the source image with lines in the output.
(2) In poems each line should normally be preserved.
Can there be an option to preserve all line breaks? This is a feature request.