Open anirvan opened 4 years ago
I don't fully understand this, given that the code's already deleting every single non-7-bit character from the raw email message. Maybe this is an issue with an invalid Unicode character in the original email message as sent by the list participant? If that were the case, I don't think it's appropriate for this script to try to correct it.
But we should find this out. @jnew-gh, can you take a look at the 4793rd email in your list, and see what's going on with that?
If you run yahoo-groups-archive-tools
with the --noisy
option, it'll say something like
message 5000: wrote email at /somewhere/email/5000.eml (4793 of 7615)
Once you know the message ID (in this example, 5000), you can look at the associated .eml file. Is there something funky going on there?
Feel free to paste the headers as a comment, redacting any private bits. Thank you!
P.S. In addition to kmail, could you try loading the mailbox using another mbox-friendly mail client, e.g. mutt, Thunderbird, Apple Mail, etc.? I'm curious if this is a kmail specific issue.
Originally posted by @jnew-gh in https://github.com/anirvan/yahoo-group-archive-tools/issues/2#issuecomment-566259511