lebedov / python-pdfbox

Python interface to Apache PDFBox command-line tools.
Other
75 stars 24 forks source link

start_page and end_page not working #16

Open luke4u opened 4 years ago

luke4u commented 4 years ago

Hi Guys,

Trying to set the start page and end page for extraction texts. But produced texts for all pages. Could anyone explain why?

p = pdfbox.PDFBox()
p.extract_text(input_path = file_path, output_path = '', password = None, encoding = None, html = False, sort=False,
               ignore_beads=False, startpage = 4, end_page=6)

Thanks a lot. Luke