Closed GoogleCodeExporter closed 9 years ago
[deleted comment]
pepeyola,
This is VERY interesting. Can you explain "with a different image proccessor".
What's the difference between this and the tess engine? Are you saying they
glued
together there own (or another) IP & the tess engine (whatever that is)?
Where did you learn about this? Maybe the hacker's guide?
KB
Original comment by beaumon...@gmail.com
on 15 Aug 2007 at 9:17
beaumont.k
You can see the details of WeOCR server in
http://asv.aso.ecei.tohoku.ac.jp/tesseract.
I think they use its own Image Processor together with the character
recognition
engine of Tesseract.
For more details, you can ask to the author of the project, Professor Hideaki
GOTO
(http://www.sc.isc.tohoku.ac.jp/~hgot/)
Original comment by pepey...@gmail.com
on 15 Aug 2007 at 11:05
The image processing that goes before an OCR engine is always going to be
critical to
its accuracy. The thresholding algorithm in tesseract 2.00 is very basic. It
was the
best published algorithm out of those that I tested in the mid 1990s (See
http://www.hpl.hp.com/techreports/93/HPL-93-22.pdf for more information)
Unfortunately, the adaptive thresholding algorithm that was developed alongside
tesseract, which was significantly better, was not part of the open source
release,
due to its commercial utility. There could easily be other open source or
published
algorithms available by now, and some day one of these may find its way into
tesseract.
Original comment by theraysm...@gmail.com
on 17 Aug 2007 at 8:58
Issue 172 has been merged into this issue.
Original comment by theraysm...@gmail.com
on 30 Dec 2008 at 4:29
Fixed in 3.01
Original comment by theraysm...@gmail.com
on 20 May 2010 at 6:55
Is this fix available in the public svn? I checked out the svn today but
couldn't find anything, and issue 172 still outputs "white" for me.
Original comment by iainmel...@gmail.com
on 11 Jul 2010 at 10:53
I believe Ray intended to close this one.
Original comment by joregan
on 23 Feb 2012 at 11:11
Original issue reported on code.google.com by
pepey...@gmail.com
on 11 Aug 2007 at 6:45Attachments: