jsvine / pdfplumber

Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
MIT License
6.1k stars 625 forks source link

Is there any method to remove the header and footer of a pdf? #843

Closed 154192 closed 1 year ago

154192 commented 1 year ago

Please describe, in as much detail as possible, your proposal and how it would improve your experience with pdfplumber.

jsvine commented 1 year ago

The PDF specification does not have concept of a header or footer; anything that looks like a header or footer is implemented by the particular software that is writing the PDF. For that reason, there is no generic solution for removing headers/footers (although there may be a specific solution for whatever specific PDFs you're working with).