Open wendlerc opened 1 year ago
+1
Hi, @wendlerc and @Mohammed20201991. Any recommendations for a text detector (for handwritten texts on images) to combine with TrOCR? Your insights would help immensely.
In this repo here: https://github.com/LAION-AI/OCR-ensemble we mainly used the one from paddleocr. We also started looking into https://github.com/open-mmlab/mmocr that seems to have a 'complementary' text detector. Complementary in the sense that it has quite different strenghts/weaknesses compared to the paddleocr detector.
Hi @bit-scientist as @wendlerc mentioned in addition to integrate with other approaches like PyLia & transkribus this might help .
In this repo here: https://github.com/LAION-AI/OCR-ensemble we mainly used the one from paddleocr. We also started looking into https://github.com/open-mmlab/mmocr that seems to have a 'complementary' text detector. Complementary in the sense that it has quite different strenghts/weaknesses compared to the paddleocr detector.
Thank you, @wendlerc. Could you share your handwritten samples for comparison. It turns out some algorithms work well only with clean background. Mine, however, has somewhat different background.
Hi @bit-scientist as @wendlerc mentioned in addition to integrate with other approaches like PyLia & transkribus this might help .
Thank you, @Mohammed20201991. I think PyLaia can be of help, but is transkribus available for free? Looks like it isn't free.
The TrOCR models do recognition. Thus, in order to apply it to arbitrary images, one needs a boundingbox detector.
Which one would you recommend?
I am currently using the one from paddleocr.
Cheers, Chris