simonw / mbox-to-sqlite

Load email from .mbox files into SQLite
Apache License 2.0
24 stars 5 forks source link

Handle HTML emails and other attachments #2

Open simonw opened 2 years ago

simonw commented 2 years ago

This has only been tested against plain text emails so far.

I think HTML emails should have the HTML stashed in a separate column.

Not sure how best to handle other attachments though, especially attachments that might be images included in the HTML.

Should also think about XSS attacks and suchlike: should this tool run HTML through https://bleach.readthedocs.io/ or should tha be left as an exercise for the user?