py-pdf / pypdf

A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
https://pypdf.readthedocs.io/en/latest/
Other
8.31k stars 1.41k forks source link

NotImplementedError: unsupported filter /JBIG2Decode #951

Closed fredgido closed 1 year ago

fredgido commented 2 years ago

Would be great support to extract JBIG2 image file from pdfs. I intend to call as usual for image resources call .getData on a object with /JBIG2Decode decode filter and then write that to a file. Then, outside of PyPDF2 score, convert the saved file to a common format.

Here is pdfminer implementation: https://github.com/pdfminer/pdfminer.six/blob/master/pdfminer/jbig2.py

ItDoesntWorkScan.pdf

pubpub-zz commented 1 year ago

closed as duplicate of #1989