pdfminer / pdfminer.six

Community maintained fork of pdfminer - we fathom PDF
https://pdfminersix.readthedocs.io
MIT License
5.96k stars 930 forks source link

Optimize (#911) #912

Closed mitchelljkotler closed 4 months ago

mitchelljkotler commented 1 year ago

Pull request

This closes #911, optimizing the apply_png_predictor, reducing the time from 29 minutes to 4 seconds on a particular PDF I have.

How Has This Been Tested?

I ran the test suite

Checklist

pietermarsman commented 9 months ago

@mitchelljkotler Thanks for the update! There is still one test failing.

mitchelljkotler commented 9 months ago

@mitchelljkotler Thanks for the update! There is still one test failing.

I fixed the mypy check.. Now the docs are failing to build, but I believe that is unrelated?

pietermarsman commented 4 months ago

Yep, I fixed that in https://github.com/pdfminer/pdfminer.six/commit/9cc4d1ddc615fddc5901ead63d11fdf3142f5499.

So merging.