atlanhq / camelot

Camelot: PDF Table Extraction for Humans
https://camelot-py.readthedocs.io
Other
3.61k stars 349 forks source link

Issue in detect short lines and merged cells of the table #414

Open parthjpandya opened 4 years ago

parthjpandya commented 4 years ago

Hi,

https://camelot-py.readthedocs.io/en/master/user/advanced.html#detect-short-lines With the above link, able to detect the tables in grid format but while getting the actual output merged cells are coming in a single cell. I am able to detect lines like below: image But how to get the actual output table like this: image The current output we are getting like this: image

Hope! I am clear with my question

potatoHHHH commented 4 months ago

I had the same problem