bbottema / outlook-message-parser

A Java parser for Outlook messages (.msg files)
76 stars 35 forks source link

Issues with certain unicode characters #58

Open robiulh opened 2 years ago

robiulh commented 2 years ago

I have been trying to use the outlook-message-parser and have found certain issues when using the following unicode characters

Hyphen (U+2010) U+2013 : EN DASH U+2019 : RIGHT SINGLE QUOTATION MARK {single comma quotation mark} U+2022 : BULLET {black small circle}

they are not displayed properly, when using getBodyHTML();

Do you know if a fix is available? I have tried with the version 1.7.13. Thanks