nisaacson / pdf-extract

Node PDF Extract
MIT License
383 stars 76 forks source link

Way to remove header and footer in the generated text #41

Open svang-app opened 3 years ago

svang-app commented 3 years ago

Hi,

The tool works great for the extraction of data but in some pages - the footer text gets intermingled with the page text body and it breaks the parsing.

Is there a way to turn off the header and footer extraction somewhere in the code?

Thank YOU