jsvine / pdfplumber

Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
MIT License
6.57k stars 659 forks source link

Slow read speed #563

Closed wenderWang closed 2 years ago

wenderWang commented 2 years ago

When I opened a 600-plus page PDF, it took me 10 minutes,I hope the author can optimize it

jsvine commented 2 years ago

Thanks for your interest, @Veunsia. This library's speed is largely constrained by pdfminer.six's performance, since we use that library under the hood. Not a whole lot we can do about it, unfortunately, so closing this issue for now.