Open neilkumar opened 4 days ago
@MthwRobinson This issue seems to be related to the wolfi base image. What do you think?
@neilkumar - Are you using the arm64 or amd64 image?
And actually didn't realize tesseract
was on the wolfi package manager now, we should switch to using that instead of the APK file we built regardless.
@MthwRobinson Locally in my dev environment I'm using the arm64 builds, but for production we're using amd64.
Got it thanks. Do you have the same issue for the amd64
image? The amd64
build runs all of our unit tests during CI and that includes partition_image
.
I should be able to take a look at this before the end of the week.
I did not get to deploying to production yet, so I'm not sure on the amd64.
For my use case, I have a Dockerfile that builds on yours by a few internal utilities, and that's where I addressed the issue for now (by uninstalling your tesseract and installing the latest from wolfi, and then installing the language packs).
Describe the bug The docker images are missing tesseract.
To Reproduce
I was able to trace this to the
tesseract
binary being missing. Running:I upgraded the tesseract library (as well as a bunch of other outdated libraries) to the latest version. That fixed the issue of the missing tesseract binary.
Running the above code again.
So the next issue was
TESSDATA_PREFIX
is set to/usr/local/share/tessdata
when it should be/usr/share/tessdata
.Fixing that and running again but got the same issue. It turns out there are no language files in the tessdata either.
Running
Fixed it, and then
partition_image
executed without error.Expected behavior That
partition_image
would work on the sample image inexample-docs
.Additional context Add any other context about the problem here.