yeungchenwa / OCR-SAM

Combining MMOCR with Segment Anything & Stable Diffusion. Automatically detect, recognize and segment text instances, with serval downstream tasks, e.g., Text Removal and Text Inpainting
533 stars 37 forks source link

Difference in performance between mmocr_sam_erase.py and mmocr_sam_erase_app.py #41

Closed haaaaak closed 8 months ago

haaaaak commented 8 months ago

Hello! First of all, thank you so much for providing us with a good source code. I was interested in the task of erasing ocr among your projects, of the two '.py' files that erase ocr, the performance of mmocr_sam_erase_app.py using gradio was much better. This file clears the ocr neatly, while the result of the 'mmocr_sam_erase.py' file feels like it is changing the shape of the letter, rather than erasing it. Of course, both dilate_iteration and img_size proceeded the same.

Is this a common phenomenon? I look forward to hearing from you. Thank you!

haaaaak commented 8 months ago

The problem was identified, and the problem was the difference in the size of the input image. Thank You!

yeungchenwa commented 8 months ago

Thanks for your attention~@haaaaak