camelot-dev / camelot

A Python library to extract tabular data from PDFs
https://camelot-py.readthedocs.io
MIT License
2.81k stars 449 forks source link

Table with white/black background not being fully captured #386

Open davidsbatista opened 1 year ago

davidsbatista commented 1 year ago

Hi there! First time camelot user, and trying to detect the full table. The upper part with white background is correctly detected, the bottom with black background (and white text) is not detected as part of the table.

Screenshot 2023-07-06 at 16 38 19

I've tried different options for flavor and process_background but still no success. Anyone knows if it's possible to capture the full table given that the lower part is in the opposite background/text colour?

bosd commented 1 year ago

Might be related to https://github.com/camelot-dev/camelot/pull/203