TTesseractOCR4 is a Object Pascal binding for tesseract-ocr 4.x - an optical character recognition engine.
Examples were tested in Delphi 10.2.3 (32-bit build for Windows) and Lazarus 1.8 (32-bit build for Windows and Linux in Ubuntu 18.04).
lib\tesseractocr-master.zip
. Unpack and copy all DLL files to bin\
.sudo apt install tesseract-ocr
.{$DEFINE USE_CPPAN_BINARIES}
accordingly in tesseractocr.consts.pas
if using Tesseract libraries built with CPPAN (defined as default).bin\tessdata
.eng.traineddata
).examples\delphi-console-pdfconvert
example requires osd.traineddata
and pdf.ttf
files.Open and compile example project:
examples\delphi-console-simple
. Recognize text in samples\eng-text.png
and write to console output
examples\delphi-vcl-image
4 tabs:
examples\delphi-console-pdfconvert
. Convert samples\multi-page.tif
(multiple page image file) to a PDF file
examples\lazarus-console-simple
. examples\delphi-console-simple
for Lazarus
MIT