AmitGorvadiya / tesseract-ocr

Automatically exported from code.google.com/p/tesseract-ocr
Other
0 stars 0 forks source link

tesseract exit when treating multiple images with the baseapi #207

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
Operating System: Ubuntu 8.04

We use tesseract baseapi to treat 6000 regions of 70 different images in 
one process.

When we reach about 6000 calls to TessBaseAPI::TesseractRect, 
tesseract fail to treat the image and exit.

Before each call we init tesseract with TessBaseAPI::SimpleInit(dataPath, 
lang, 0).

After each call we end with TessBaseAPI::End()

Is it a none recoverable error?
Is there anyway to catch the error and prevent the program to exit?

Here is the error message:

error: Could not find the edge to remove, 36233 char = '0 : next = 20849, 
char = 'C', FORWARD         
1 : next = 36876, char = 'c', FORWARD         
2 : next = 37052, char = 'B', FORWARD         
3 : next = 22712, char = 'b', FORWARD         
4 : next = 37651, char = 'A', FORWARD         
5 : next = 4488, char = 'a', FORWARD LAST EOW
6 : next = 15341, char = 's',              EOW
7 : next = 36237, char = 's',              EOW
8 : next = 13011, char = 'l',              EOW
9 : next = 37083, char = 'n',              EOW
10 : next = 49647, char = 'l',              EOW
11 : next = 35215, char = 's',              EOW
12 : next = 44531, char = 'd',              EOW
13 : next = 8303, char = 'g',              EOW
14 : next = 21121, char = 'n',              EOW
15 : next = 25888, char = 's',              EOW
16 : next = 16336, char = 'a',              EOW
17 : next = 21795, char = 'f',              EOW
18 : next = 38072, char = 'l',              EOW
19 : next = 15386, char = 'n',              EOW
20 : next = 49571, char = 's',              EOW
21 : next = 38451, char = 's',              EOW
22 : next = 11561, char = 'm',              EOW
23 : next = 32280, char = 'a',              EOW
24 : next = 22712, char = 's',              EOW
25 : next = 31803, char = 'e',              EOW
26 : next = 45562, char = 's',              EOW
27 : next = 38386, char = 'g',              EOW
28 : next = 6686, char = 'y',              EOW
29 : next = 49289, char = 'd',              EOW
30 : next = 37518, char = 'd',              EOW
31 : next = 19008, char = 'e',              EOW
32 : next = 23293, char = 's',              EOW
33 : next = 46362, char = 's',              EOW
34 : next = 40498, char = 'y',              EOW
35 : next = 7734, char = 's',              EOW
36 : next = 14071, char = 's',              EOW
37 : next = 49339, char = 'y',              EOW
38 : next = 10702, char = 's',              EOW
39 : next = 6474, char = 'a',              EOW
40 : next = 11183, char = 's',              EOW
41 : next = 38564, char = 't',              EOW
42 : next = 48406, char = 't',              EOW
43 : next = 4488, char = 'z',              EOW
44 : next = 44669, char = 'g',              EOW
45 : next = 13001, char = 's',              EOW
46 : next = 8820, char = 's',              EOW
47 : next = 20577, char = 's',              EOW
48 : next = 18033, char = 's',              EOW
49 : next = 24865, char = 's',              EOW
50 : next = 41490, char = 's',              EOW
51 : next = 32397, char = 'r',              EOW
52 : next = 35508, char = 's',              EOW
53 : next = 8648, char = 's',              EOW
54 : next = 44825, char = 's',              EOW
55 : next = 42319, char = 's',              EOW
56 : next = 35084, char = 's',              EOW
57 : next = 10537, char = 's',              EOW
58 : next = 28156, char = 's',              EOW
59 : next = 10500, char = 's',              EOW
60 : next = 21202, char = 's',              EOW
61 : next = 10412, char = 's',              EOW
62 : next = 4090, char = 's',              EOW
63 : next = 10265, char = 's',              EOW
64 : next = 22809, char = 'a',              EOW
65 : next = 22268, char = 'e',              EOW
66 : next = 29459, char = 'd',              EOW
67 : next = 4467, char = 'a',              EOW
68 : next = 35597, char = 'n',              EOW
69 : next = 47870, char = 's',              EOW
70 : next = 43296, char = 's',              EOW
71 : next = 21468, char = 's',              EOW
72 : next = 7773, char = 's',              EOW
73 : next = 42724, char = 'n',              EOW
74 : next = 37686, char = 's',              EOW
75 : next = 30014, char = 's',              EOW
76 : next = 19389, char = 'm',         LAST EOW

Please use labels and text to provide additional information.

Original issue reported on code.google.com by alexnanc...@gmail.com on 19 May 2009 at 10:06

GoogleCodeExporter commented 9 years ago
i have the same problem but whe i used french language instead of english this
doesn't appear

Original comment by lancereau.flavie@gmail.com on 4 Nov 2009 at 4:26

GoogleCodeExporter commented 9 years ago
We solve the problem by initiating Tesseract once per image despite the fact 
that
we process multiple regions in the image.

So, we call TessBaseAPI::SimpleInit(dataPath, lang, 0) before processing the 
image and
TessBaseAPI::End() when we have finished processing all the image regions.

Original comment by alexnanc...@gmail.com on 4 Nov 2009 at 4:33

GoogleCodeExporter commented 9 years ago
it seems to works thx =)

Original comment by lancereau.flavie@gmail.com on 4 Nov 2009 at 4:46

GoogleCodeExporter commented 9 years ago
Fixed in 3.00.

Original comment by theraysm...@gmail.com on 19 May 2010 at 11:02