UAlbanyArchives / mailbagit

A tool for creating and managing Mailbags, a package for preserving email using multiple preservation formats
https://archives.albany.edu/mailbag/
MIT License
46 stars 2 forks source link

added manual utf-8 decode check for plain text pst bodies #180

Closed gwiedeman closed 2 years ago

gwiedeman commented 2 years ago

Type of Contribution

What does this implement/fix? Explain your changes.

For parsing PST files, mailbagit now manually adds a utf-8 decode check for plain text bodies, as this seems to work well. Like .html_body, it still tries PidTagInternetCodepage and PidTagMessageCodepagel as first priorities, but then tries utf-8 and finally falls back to chardetect.

Link to issue?

176

Pull Request Checklist

Please check if your PR fulfills the following requirements:

How has this been tested?

Operating System: win10 Python Version: 3.9.12

Licensing