Open GoogleCodeExporter opened 9 years ago
please provide test case.
Original comment by zde...@gmail.com
on 7 Feb 2015 at 7:17
now=$(date +"%y%m%d-%H%M");
LANG=hin
cd testing
for f in page-019.tif
do
echo "OCR at $(date) with -l $LANG for $f file , please wait..."
tesseract --tessdata-dir C:/Home/UserShree/tesseract-ocr/testing $f $f-$LANG
-l $LANG -psm 6 pdf
LANG=san
tesseract --tessdata-dir C:/Home/UserShree/tesseract-ocr/testing $f $f-$LANG
-l $LANG -psm 6 pdf
done
------------
The txt and pdf output as well as the input file are attached. When I copy the
text from pdf, it is formatted differently than the txt files.
Original comment by shreeshrii
on 17 Feb 2015 at 1:30
Attachments:
Original issue reported on code.google.com by
shreeshrii
on 31 Oct 2014 at 4:12