Closed Shreeshrii closed 6 years ago
Getting errors while creating box file for Devanagari script.
Attaching zip file with tif, gt.txt and generated box file.
I had generated groundtruth files using tesseract, which added a FF to the OCRed text file. That was the cause of the error.
Changed the command to following to get rid of problem.
tesseract --tessdata-dir ../tessdata "${img_file}" "${img_file%.*}-gt" --psm 6 --oem 1 -l san -c page_separator=''