xperseguers / t3ext-extractor

TYPO3 Extension extractor
https://extensions.typo3.org/extension/extractor
GNU General Public License v2.0
15 stars 24 forks source link

UTF-8 problem with PDF and native PHP #35

Open ste101 opened 3 years ago

ste101 commented 3 years ago

I have files with German umlauts (ä,ü) in meta data and there are problems with the PHP function. With phpinfo -enc UTF-8 everything is ok.

When I upload this file char after the umlaut are cut. Within the extension configuration mapping tool the strings are empty. If I do a mb_convert_encoding($out, 'UTF-8', 'UTF-8'); in decodeStringType the full sting is show but with a '?' instead of the umlaut.

xperseguers commented 3 years ago

Please attach a sample PDF file to reproduce that problem.