libyal / libpff

Library and tools to access the Personal Folder File (PFF) and the Offline Folder File (OFF) format
GNU Lesser General Public License v3.0
289 stars 74 forks source link

Is there a way to get the encoding for html_body? #112

Closed gwiedeman closed 2 years ago

gwiedeman commented 2 years ago

Currently the .html_body attribute for a pypff.message object returns a bytestring of the message body. However, messages may have a variety of encodings. Is there some way of accessing the encoding for each message body so it may be reliably decoded?

I understand that this is still a work in progress, and I appreciate the lack of time, so I'm not expecting a fix if this is yet to be implemented, but I wanted to make sure there wasn't an approach that I was missing. The .record_sets attribute seems to potentially have data, but these also appear to be bytestrings which seems to run into the same issue.

Thanks for all the work done so far!

joachimmetz commented 2 years ago

This encoding occurs on the MAPI level not on the PFF database level. Have a look at my notes https://github.com/libyal/libfmapi/blob/main/documentation/MAPI%20definitions.asciidoc#61-the-message-body