MhLiao / TextBoxes_plusplus

TextBoxes++: A Single-Shot Oriented Scene Text Detector
Other
954 stars 279 forks source link

Low recall #113

Open Arseny-N opened 5 years ago

Arseny-N commented 5 years ago

Hello @MhLiao, I would like to thank you for the great work !

But I am experiencing problems with running the network. My results differ significantly from what others archive. The network fails to detect text in several areas leading to very low recall (0.3) which in turn leads to a low f-measure (0.5). The precision is quite high - 0.9.

I have compared my results on demo.jpg with yours an they are quite similar - https://github.com/MhLiao/TextBoxes_plusplus/issues/19

603,94,675,96,672,118,601,116,0.998044
603,96,669,97,668,117,603,116,0.937017

601,201,678,199,680,232,604,233,0.997581
600,199,680,197,682,232,602,234,0.998967

620,119,663,121,660,140,616,138,0.988785
619,118,661,119,660,139,618,138,0.956784

617,73,661,74,660,94,615,93,0.981392
619,73,661,74,659,93,617,93,0.964453

620,141,664,142,663,161,619,161,0.932351
621,141,661,141,661,160,620,160,0.903893

where the first line is yours and the second is mine. I reordered the detections.

I downloaded the icdar2015 pretrained model from dropbox and it has a sha1 sum of 4c80a5ee6c7e2eebaa200ad8b75d68a048abc6b4

I use the code from the examples/text/demo_det.py file.

Here are some failure examples, the red boxes are the ground truth ones and the green boxes are what is detected.

p1 p2 p3 p4

I also run the model on icdar focused scene text dataset the and it gives better results - F1 score: 0.708376421923 precision: 0.816448152563 recall: 0.625570776256.