Layout-Parser / layout-parser

A Unified Toolkit for Deep Learning Based Document Image Analysis
https://layout-parser.github.io/
Apache License 2.0
4.75k stars 456 forks source link

Challanges with table data parasing. #100

Open SAIVENKATARAJU opened 2 years ago

SAIVENKATARAJU commented 2 years ago

Hi , I am trying to detect the tables inside PDF and the trying to parse the table without changing the positions. I am able to detect the table as showing in the picture, however I am using https://layout-parser.readthedocs.io/en/latest/example/parse_ocr/index.html#an-alternative-method-adaptive-grouping-lines-based-on-distances. its not working in my case table

Looking for your valuable suggestions.How to parse all a columns. and convert this to a pandas dataframe

SAIVENKATARAJU commented 2 years ago

Can Someone suggest me. is the above process of converting pdf table to pandas dataframe url works for every table like above