Chenhx / tesseract-ocr

Automatically exported from code.google.com/p/tesseract-ocr
Other
0 stars 0 forks source link

how can we get the position(coordinate) of each word?? #277

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
Hi, dear all

    Would you please tell us how can we get the position(coordinate) of 
each word??   We need it to find the word we really want.
    But tess only output a .txt file. It doesn't contain the coordinate 
informations.  Are they in the .raw file or .map file or some others? 

    In the "Design Documents", tess tell us--"Edit ccmain/output.cpp and 
change the FALSE following tessedit_write_raw_output to TRUE."   And we 
can find these below in output.cpp --

tessedit_write_raw_output, FALSE,
tessedit_write_output, FALSE,
tessedit_write_ratings, FALSE,
tessedit_write_txt_map, FALSE,
....

    So we change the FALSE to TRUE, and rebuild it in VC++. But the .raw 
and .map file did not appear after we ran tesseract.exe in command line. 

    Why? 
    Thank you very much!

Original issue reported on code.google.com by zor...@163.com on 25 Jan 2010 at 5:44

GoogleCodeExporter commented 9 years ago

Original comment by theraysm...@gmail.com on 20 May 2010 at 3:48