smalot / pdfparser

PdfParser, a standalone PHP library, provides various tools to extract data from a PDF file.
GNU Lesser General Public License v3.0
2.41k stars 537 forks source link

getDataTm() with PDF containing accents #725

Open tiffanymartin34 opened 4 months ago

tiffanymartin34 commented 4 months ago

getDataTm() returns segments cut on accents.

Can you help me please ?

k00ni commented 4 months ago

Please upload an example PDF here, which demonstrates the issue.