SMI / dicompixelanon

DICOM Pixel Anonymisation
3 stars 0 forks source link

ocrengine - try different paragraph settings to reduce large rectangles #26

Open howff opened 1 year ago

howff commented 1 year ago

easyocr seems to produce large rectangles when para=True, possibly because it is joining multiple small rectangles at different orientations to produce one overly-large rectangle. For example a horizontal scale 1 2 3 4 5 and a vertical scale A B C D E might be merged.

I'm sure there were some commits to the easyocr code which changed this behaviour but can't find them now. In the meantime: https://github.com/JaidedAI/EasyOCR/issues/1111

Test out this theory and see if any improvements can be made.

Also consider not using para=True and doing it manually?