smalot / pdfparser

PdfParser, a standalone PHP library, provides various tools to extract data from a PDF file.
GNU Lesser General Public License v3.0
2.37k stars 538 forks source link

No output while parsing such files with headers (I suppose) #661

Closed tejpsingh9 closed 7 months ago

tejpsingh9 commented 9 months ago

Description:

No output while parsing such files I tried wihout config first, then tried with config also.

PDF input

Attached 2 files

Expected output & actual output

Code

$config = new \Smalot\PdfParser\Config(); $config->setRetainImageContent(false); $config->setDecodeMemoryLimit(1000000); $parser = new \Smalot\PdfParser\Parser([], $config);

    $pdf = $parser->parseFile("d:\\wamp\\www\\lampkm\\uploads\\juds\\c1-j1_ivrcl_judgmt.pdf");

c1-j1_ivrcl_judgmt.pdf c1-j2_kss_judgmt.pdf

c1-j2_kss_judgmt.pdf c1-j1_ivrcl_judgmt.pdf

k00ni commented 9 months ago

PDFParser Version: Latest as on 2023/12/30

Just to be sure, you tested with 2.8.0-RC2?

tejpsingh9 commented 9 months ago

How to check the version

k00ni commented 9 months ago

In case you use Composer type the following command in the terminal:

composer show

It will show a list of installed packages:

symfony/polyfill-mbstring v1.28.0 Symfony polyfill for the Mbstring extension
[...]                        
                             ^---- version
k00ni commented 8 months ago

Any update @tejpsingh9?