christian-vigh-phpclasses / PdfToText

Extracts text from PDF files
Other
125 stars 93 forks source link

PDFOPT_CAPTURE return mathematics simbols #24

Open agusjar opened 6 years ago

agusjar commented 6 years ago

Hello. First of all you did an excelent work!!!

I´m trying to read a table but the class is omitting last 2-3 lines of each page, so I decide to read it as a Capture. Now, I can read all rows from the table but I get some characters instead numbers, I put you an example:

↑ instead 5 → instead 7 └ instead 9 ...

Can you tell me how to solve it?

Thanks

mjblacker commented 6 years ago

I assuming this will be a font issue.

Can you send me the PDF at all and I’ll take a look.

agusjar commented 6 years ago

Here you got the pdf: 16 01 2018.pdf

Actually I can "solve" the problem with str_replace because I´m only interested on numbers and dates but will be a really problem if someone got the same problem with texts.

Thank´s for your early response!!!