Open stonyw opened 5 months ago
comparison between table_areas and table_regions (with flavor='stream') table_areas recognize tables more accurate
When using Camelot's camelot.read_pdf function with table_areas and table_regions parameters, you're specifying the exact areas or regions of the page where you expect the tables to be. This is particularly useful for PDFs where tables are not well-detected using the default settings.
-
table_regions: This parameter is used to specify regions where tables are expected. It's similar to table_areas but less precise. It's useful when you have multiple tables in a region.
Here's an example of how to use these parameters: [image: Screenshot 2024-02-01 at 02.39.30.png] [image: Screenshot 2024-02-01 at 04.13.45.png] [image: Screenshot 2024-02-01 at 02.23.55.png]
Message ID: @.***>
$ camelot stream -plot contour 13pg.pdf
Hey!
As camelot is dead, we try to build a maintained fork at pypdf_table_extraction
.
Do you want to open the PR against that branch so that we can merge your improvement?