Closed ArsalanYounus007 closed 7 months ago
Hey @ArsalanYounus007 π
A short suggestion:
If possible take the main branch and try fast_base
or db_resnet50
(pytorch) :)
I have already tried pytorch db_resnet50, It's better but still misses words. I will try fast_base with pytorch and see the results
I have already tried pytorch db_resnet50, It's better but still misses words. I will try fast_base with pytorch and see the results
Keep in mind fast_base
is only available at main branch atm :)
Yep, It's better at detecting words However, I am back at the overlapping Bboxes from left (if it's not the first word) and right.
Bug description
Hello,
I hope you are having a good day. I am getting some problem with db_resnet_50 (TF). The output Bbox contains a lot of margin and overlaps with the boxes around it.
I was able to fix the problem by increasing the bin_thresh to 0.5 - 0.6 and applying further post processing turning it into this
That solves the Bbox overlapping issue for me but started another problem, Now words are missing.
This I was able to fix by splitting image (original images are big 2550 width x 3300 height) into 2 by finding the most empty location in the middle of image. OCRing that make the detection better. But still miss some words occassionally.
What would you recommend?? Aside from training my own detection model π (I will do it later)
Code snippet to reproduce the bug
Error traceback
NA
Environment
Conda ENV Python3.10 Windows 11
Deep Learning backend