Training a Text-Line detector and want to create annotations with x-height+ border automatically

monniert / docExtractor

(ICFHR 2020 oral) Code for "docExtractor: An off-the-shelf historical document element extraction" paper

https://www.tmonnier.com/docExtractor

MIT License

85 stars 10 forks source link

Training a Text-Line detector and want to create annotations with x-height+ border automatically #11

Closed seekingdeep closed 3 years ago

seekingdeep commented 3 years ago

@monniert I trained a text-line detector model, the accuracy seemed high, but when i tested, the results were very bad. i even tried training at different image sizes, but still the results were not good. My Guess is that the ground truth should not be all in the same color "cyan", you might need to choose 2 colors, example: first line "cyan", then second line "red", then 3rd line "cyan", then 4th line "red", etc..... This might help in separating close regions.

possible solution:

Example: 19699_annotated

Groundtruth: 19699_seg

Original image: 19699

monniert commented 3 years ago

Again, why do you say results are bad in this case? Mistakes seem to only occur for the footnotes. Three things you can try to prevent text lines merging and improve results:

clean ground truth: it seems you have merged line labels in your GT, so you should expect predicted lines to be merged as well, you first need to clean it to prevent such behaviour
use x-height label instead of bounding box label (check here): predicted lines should be thinner and more accurate
use border label to force gaps between lines (check the paper for explanation and ablation experiments)

monniert commented 3 years ago

@seekingdeep closing the issue, please reopen if necessary

seekingdeep commented 3 years ago

you are right, this most likely is caused because i didn't use x-height+border in the annotation for training. instead i used bounding-box for entire text-line which resulted in text-line to be merging with another.