ePADD / epadd

ePADD is a software package developed by Stanford University's Special Collections & University Archives that supports archival processes around the appraisal, ingest, processing, discovery, and delivery of email archives.
https://www.epaddproject.org
112 stars 24 forks source link

Smileys in Wingdings font #376

Open jfarwer opened 4 years ago

jfarwer commented 4 years ago

Microsoft Outlook automatically converts :) and :-) to a smiley saved as Wingdings font.

When converting such an email to mbox a smiley will be represented as something like

<span style=3D"font-family:=Wingdings;color:#1F497D">J

where ‘J’ is the smiley in Wingdings. In ePADD there will be a ‘J’ then instead of a smiley. This is an issue other email programs face as well and it happens quite a lot in our email test sets. It causes confusion as random letters ‘J’ appear in emails and also information is lost as smileys are not there anymore. Maybe it’s worth making these smileys being displayed as :) ?

hangal commented 3 years ago

@jfarwer does it still exist with the latest version? With changes to mbox parsing, this may have gotten fixed (but not sure). Is it even worth trying to support it?

jfarwer commented 3 years ago

@Hangal It doesn't seem to have to do with the Mbox parsing. Some of our Mbox files contain smileys encoded in quoted-printable (e.g. =F0=9F=98=80 is a smiley) where they are correctly displayed in ePADD (already with the old version of the Mbox parser). However, sometimes smileys appear as a symbol in wingdings (e.g. <span style=3D"font-family:Wingdings">J<\/span>) and then only the letter (e.g. 'J') is being displayed.

Also, when I import emails directly from a GMX account, smileys are ignored, and they don’t appear in the text in any form. That might be a problem with GMX. When I import from a Gmail account, smileys are correctly displayed.

tomhigginsuom commented 10 months ago

We should this document this case in the user manual