smalot / pdfparser

PdfParser, a standalone PHP library, provides various tools to extract data from a PDF file.
GNU Lesser General Public License v3.0
2.31k stars 534 forks source link

Parsing of tagged pdf? #197

Open kschroeer opened 6 years ago

kschroeer commented 6 years ago

It's more a question then an issue: is it possible to parse tags from the document? I think about the marks for headings, tables etc.

smalot commented 6 years ago

Currently it is not supported. May be someone could implement this king of object parsing