monniert / docExtractor

(ICFHR 2020 oral) Code for "docExtractor: An off-the-shelf historical document element extraction" paper
https://www.tmonnier.com/docExtractor
MIT License
85 stars 10 forks source link

Training a Text-Line detector and want to create annotations with x-height+ border automatically #11

Closed seekingdeep closed 3 years ago

seekingdeep commented 3 years ago

@monniert I trained a text-line detector model, the accuracy seemed high, but when i tested, the results were very bad. i even tried training at different image sizes, but still the results were not good. My Guess is that the ground truth should not be all in the same color "cyan", you might need to choose 2 colors, example: first line "cyan", then second line "red", then 3rd line "cyan", then 4th line "red", etc..... This might help in separating close regions.

possible solution: image

Example: 19699_annotated

Groundtruth: 19699_seg

Original image: 19699

monniert commented 3 years ago

Again, why do you say results are bad in this case? Mistakes seem to only occur for the footnotes. Three things you can try to prevent text lines merging and improve results:

monniert commented 3 years ago

@seekingdeep closing the issue, please reopen if necessary

seekingdeep commented 3 years ago

you are right, this most likely is caused because i didn't use x-height+border in the annotation for training. instead i used bounding-box for entire text-line which resulted in text-line to be merging with another.