openpaperwork / pyocr

A Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab
https://gitlab.gnome.org/World/OpenPaperwork/pyocr
930 stars 152 forks source link

Ftr minimal digit #49

Closed bnguyenvanyen closed 7 years ago

bnguyenvanyen commented 7 years ago

So this has got a number of changes :

On my system (libtesseract3.4.1) most of the tests pass except :

jflesch commented 7 years ago

Wow, nice ! There is obviously a lot of work here :) However, it's going to take me a while to review it. I'll add comments as I read the code. Please do not update the code until I'm finished (following the discussions would be a mess otherwise).

edit: oh actually, I didn't notice that Github added a button "finish review" to not publish every comment immediately :)

bnguyenvanyen commented 7 years ago

Hi, thank you for your reactivity ! I put in the minor tweaks.

I'm now working on a way to make the builder system more flexible (I rewrote libtesseract.image_to_string) but it would break any code inheriting from the existing builders, so it will probably have to stay on my fork.

If some day you're cleaning up the builders and breaking backward-compatibility anyway you might think about it !

Cheers

jflesch commented 7 years ago

My bad, I totally forgot to review this pull request, sorry :(

Anyway, it's perfect, I'm merging it. Thank you very much for this contribution :)

jflesch commented 7 years ago

It will be included in 0.4.3. I'll release today (if the tests are fine)