Closed GoogleCodeExporter closed 9 years ago
Problem is that your input files use windows end-of-line (\r\n) and tesseract
expect unix like end-of-line (\n). While it is not and problem within text
(4.txt, 5.txt) it cause problem where first line is empty line (files 3.txt,
2.txt).
So you can:
1. remove first empty line
2. use unix like end-of-line (recommended)
You can use util dos2unix to convert end of line or some advanced editors (e.g.
Notepad++ on windows) for this task.
Original comment by zde...@gmail.com
on 1 May 2015 at 12:56
Original issue reported on code.google.com by
adityaku...@gmail.com
on 8 Dec 2014 at 2:54Attachments: