Closed bado-lee closed 6 years ago
We are having the Spring Festival and have no machine to test the code now. I will check the performance after the vacation.
Thanks a lot for the reply. I'll wait for your later response. Happy Holidays.
@bado-lee The previous demo includes detection and recognition, and the "f_score_threshold" is not optimal for reproducing the detection result for IC15. To reproduce the detection results, you can use "demo_det.py", which only includes detection. (You also need to change the input scale into 1024*1024.)
@MhLiao Thanks for the reply. I've checked and tested your new code, but it's still short in 2% point.
'input_height' : 1024,
'input_width' : 1024,
'overlap_threshold' : 0.2,
'det_score_threshold' : 0.2,
I've used above configurations and the only parameter that is different from my previous testing is det_score_threashold which was 0.1
below is the result so far
Calculated!{"recall": 0.7558979297063072, "precision": 0.8532608695652174, "hmean": 0.8016339034975747, "AP": 0}
which is still 1.6% less than your Quad(Single) result. Please let me know if there are other factors that I can try. Thanks in advance.
@bado-lee I am sorry that I gave a wrong model which achieves lower performance. Now the link to the model file is updated you can re-download it. The current model should achieve an F-measure of 0.816, whose performance is comparable to the model in the paper.
@MhLiao Thank you very much for your feedback & help. I've finally achieved reported performance which is
Calculated!{"recall": 0.7891189215214252, "precision": 0.8523140925637025, "hmean": 0.8195, "AP": 0}
(And even 0.2% more then reported one with Quad)
And if I'm not mistaken, published code is for Quad only.
Do you have plans for publishing code&model for Quad_MS as well? Please correct me if the code is for Quad_MS (I mean if this is so, still needs 1% to get Quad_MS score).
Isn't MS score just running the model with multiple input image scales - just like TextBoxes code?
@SHaiHosh I think Multi-Scale mentioned in the paper by @MhLiao is a explicit multi-scale image input. Furthermore, the scores I've reproduced with ICDAR2015 matches with Quad score.
Calculated!{"recall": 0.7891189215214252, "precision": 0.8523140925637025, "hmean": 0.8195, "AP": 0}
@bado-lee @SHaiHosh The ms code is similar to TextBoxes. The scales include 384*384, 768*768, 1024*1024 and 1536*1536.
thank you @MhLiao
hi @bado-lee where did you find the conf.lua file to run the demo.py file?
@DecentMakeover Hi, I have reproduced detection only. So, I can't answer your question sorry.
@bado-lee i wanted to compare how textboxes compares to advanced east, Have you by any chance looked into advanced east?
Hi, I'm trying to reproduce the result in the paper which are, But it seems not reproducible.
Im using the pre-trained model downloaded from your link which is for ICDAR2015 (model_icdar15.caffemodel)
I've tested in 2 ways, These are my parameters, and the rest is as is except the loops added for batch inference.
And for the evaluation, I'm using the official evaluation code downloaded from ICDAR challenge
And below is the tested set link
Config 1
demo.py
deploy.prototxt
Result 1
Config 2
Same as Config 1 but, demo.py
Result 2
Both of the results does not seem to match the performance mentioned in the paper.
Please help if there are another parameters to be tuned.