atlanhq / camelot

Camelot: PDF Table Extraction for Humans
https://camelot-py.readthedocs.io
Other
3.61k stars 349 forks source link

Extraction contains only first 2 columns #400

Open kennethjones17 opened 4 years ago

kennethjones17 commented 4 years ago

I've given my code, My input PDF and the output I am receiving. I don't understand whats wrong. Do forgive if its really simple, but id like your help. Thanks in Advance. Screenshot (32) Screenshot (31) Screenshot (33)

anakin87 commented 4 years ago

I think that you should play with various parameters: see https://camelot-py.readthedocs.io/en/master/api.html#main-interface

If you can, please post the PDF document.

Did you try with flavor='lattice'?

kennethjones17 commented 4 years ago

Hi,

Its a clientele document, im not really allowed to share it, and yes, i did try with most of the parameters. Tabula was giving me a good structure, but I was always using camelot, and i need to figure out why.