useblocks / libpdf

Extract structured data from PDFs
MIT License
8 stars 2 forks source link

Duplicate table IDs #18

Closed ubmarco closed 2 years ago

ubmarco commented 2 years ago

The PDF issue-67-example.pdf has no outline but multiple tables. libpdf produces duplicate table IDs: image

table.1 appears multiple times, I think the tool will start counting table numbers every page. It should however start counting at every chapter start, or, if there is no outline, at the beginning of the document with increasing numbers, just like paragraphs.