mvz / email-outlook-message-perl

Email::Outlook::Message Perl module for reading Outlook .msg files
http://www.matijs.net/software/msgconv/
175 stars 31 forks source link

Handle RTF-encapsulated HTML #6

Open kosta-beller opened 7 years ago

kosta-beller commented 7 years ago

The attached archive contains .msg files with HTML bodies encoded in base64. The library fails to convert them correctly to .eml messages.

msg.zip

mvz commented 7 years ago

@granite-zero I'm pretty sure your issue is different: The problem with @kobeller's messages is that the HTML bodies are actually RTF, and the problem does occur on Linux. Can you open a new ticket please?

genericcx commented 6 years ago

I also have the same issue on linux when converting the html (as it sees RTF - and then add's the html part as rtf attachment). Is there a way to get round this? Perhaps being able to extract the html from the rtf part?

mvz commented 4 years ago

These bodies are RTF-encapsulated HTML (who came up with that?), and there's a process for converting it back to HTML.

kosta-beller commented 4 years ago

@mvz I know this edge case is weird, but it arrived to me in as real complaint from the customer. So, should I follow the guide you linked or do you intend to implement it in the library? Also, any tips to deterministically identify those messages and distinguish them, for example, from plain RTF?

mvz commented 4 years ago

@kobeller I'm considering implementing it in the library, but I'm still investigating whether there's a suitable RTF reader library in Perl that I can use as a base.