coolwanglu / pdf2htmlEX

Convert PDF to HTML without losing text or format.
http://coolwanglu.github.com/pdf2htmlEX/
Other
10.35k stars 1.84k forks source link

HTML source code is gurbled when convert Japanese hiragana and kanji. #727

Open kawazoe-kotaro opened 7 years ago

kawazoe-kotaro commented 7 years ago

Hello. I installed pdf2htmlEX on Ubuntsu. Then, it can be convert pdf included Japanese and Kanji to html. I checked html that pdf converted correctly on web browser. But, source code is garbled. I don't have any idea about this.

Could you tell me the causing and countermeasure? My environment is as follows:

Ubuntu 16.04 LTS pdf2htmlEX 0.14.6 poppler 0.41.0 libfontforge 20120731 Supported image format: png jpg

Thank you.

qianShang-Tj commented 7 years ago

Do you install poppler-data?