smalot / pdfparser

PdfParser, a standalone PHP library, provides various tools to extract data from a PDF file.
GNU Lesser General Public License v3.0
2.33k stars 535 forks source link

Not working properly while parsing Hindi PDF #311

Open harshsanghani opened 4 years ago

harshsanghani commented 4 years ago

I needs to parse the PDF that contains the Hindi character but it does not provide me proper result rather It gives me many symbolic output for the parse.

k00ni commented 4 years ago

Hi @harshsanghani, can you provide a PDF file which causes these problems? The file should be free to use because we may include it into our test suite.