smalot / pdfparser

PdfParser, a standalone PHP library, provides various tools to extract data from a PDF file.
GNU Lesser General Public License v3.0
2.37k stars 538 forks source link

PNG Images with FlateDecode are not extracting #638

Open sbhshoaib opened 1 year ago

sbhshoaib commented 1 year ago

pngimage.pdf

The attached PDF Contains a PNG image that is not extracting correctly. The image becomes corrupted after extraction. Maybe there is some issue in flate decoding.

sbhshoaib commented 1 year ago

@GreyWyvern Can you please check ?

GreyWyvern commented 1 year ago

I'm afraid I've been focusing on the text-extraction parts of PdfParser and not the image extraction ones. Can you provide the sample code that displays the error with your example file?