Closed bananarama456 closed 7 months ago
You're installing a tesserocr binary wheel which only supports PNG. I just updated the GA workflow to build binaries with jpeg/tiff/webp support. You can try out the wheels from https://github.com/sirfz/tesserocr/actions/runs/8189850537
If you wish to build tesserocr yourself, install as follows:
pip install --no-binary tesserocr tesserocr
Of course, you need all requirements to be already installed
tesserocr v2.6.3 binaries are now built with jpeg (as well as tiff and webp) support
When I run the tesserocr.tesseract_version() command in my container, I only get "tesseract 5.3.3\n leptonica-1.83.1\n libpng 1.6.34 : zlib 1.2.11", meaning I am unable to run tessocr against jpeg, gif, tiff files etc.
My dockerfile looks like this:
And am just doing a pip install tessrocr in my requirements.txt.
When I look at my container image I see that it has downloaded libgif, libjpeg, libtiff ,libopenjp2 etc but tessrocr cannot find them when initializing the model.
Any help would be greatly appreciated. Thanks :)