Combining MMOCR with Segment Anything & Stable Diffusion. Automatically detect, recognize and segment text instances, with serval downstream tasks, e.g., Text Removal and Text Inpainting
Hi @wendlerc
Sorry for the late reply. We use scene text datasets to train the text detector, thus it may not perform well on document images. Yes, this behavior is expected.
Or do I just have to configure the pipeline differently: