stefanklut / laypa

Layout analysis to find layout elements in documents (similar to P2PaLA)
MIT License
17 stars 4 forks source link

Relevant information for ground truth generation #42

Closed icarl-ad closed 2 weeks ago

icarl-ad commented 1 month ago

Hi,

we are currently rethinking our ground truth generation rules. We have some questions regarding the baselines and text regions:

Thank you in advance!

stefanklut commented 1 month ago

Hi there,

The regions and reading order are normally done as a post processing step. The trained region models are specifically for detecting known page structures (e.g. distinction between marginalia and main text is necessary). The trained models use pixel wise classification. So to specifically answer your questions:

I hope this clears up all your questions, and if not feel free to ask more