openpaperwork / pyocr

A Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab
https://gitlab.gnome.org/World/OpenPaperwork/pyocr
930 stars 152 forks source link

Extract Individual Characters #87

Open ZoranPavlovic opened 6 years ago

ZoranPavlovic commented 6 years ago

Feature request to extract individual characters from the OCR process. I had a quick look, and it looks like the underlying iterators provided by libtesseract API do allow for individual characters to be enumerated.

See here:

And here: