tabulapdf / tabula-java

Extract tables from PDF files
MIT License
1.82k stars 425 forks source link

Input of different table areas on different pdf pages. #486

Open RyosukeSakaguchi opened 2 years ago

RyosukeSakaguchi commented 2 years ago

If a PDF has multiple pages, I want to specify a different table area for each page. In other words, I would like the --area option to be able to specify the table area per page as follows.

{
1: [[10, 20, 30, 40], ..],
2: [[80, 100, 90, 120], ..], 
4: [[..], ..], ...
}

The dictionary key is the page number. It is only written in dictionary form for clarity, and the format can be anything, as long as a similar specification can be made with the --area option.