Layout-Parser / layout-parser

A Unified Toolkit for Deep Learning Based Document Image Analysis
https://layout-parser.github.io/
Apache License 2.0
4.67k stars 449 forks source link

multi column variable #139

Open wilianuhlmann opened 2 years ago

wilianuhlmann commented 2 years ago

I have some images that range from 1 to 5 columns. The problem is that any one above 1 column I can't extract the text in an orderly way as you can see in the two attached images. How can I resolve or improve this? I saw an example posted here for 2 columns but I couldn't solve it for my case: The language of the documents is Portuguese Brazil

layout_parser layout_parser_4_columns

Thanks Guys

MarouaneZ1 commented 1 year ago

I have some images that range from 1 to 5 columns. The problem is that any one above 1 column I can't extract the text in an orderly way as you can see in the two attached images. How can I resolve or improve this? I saw an example posted here for 2 columns but I couldn't solve it for my case: The language of the documents is Portuguese Brazil

layout_parser layout_parser_4_columns

Thanks Guys

brother can you plz tell me which model you used for detection those blocks of text, i have a project for extracting information from pdfs i tried layoutparser model but it threw me many error, plz can you tell which model did you use to detect those blocks of text from a pdf file