hassanakbar4 / mailarchive-tickets

0 stars 0 forks source link

Remove spam from DNSOP archive #167

Closed hassanakbar4 closed 3 years ago

hassanakbar4 commented 7 years ago

component_MailArchive: ArchiveContents resolution_fixed type_task | by rcross@amsl.com


As reported by John Levine:

I'm doing some stats scraping from the IETF imap server, and some of the archives seem to contain chunks of garbage.

For example, I'm trying to get the last two years of messages from DNSOP, which IMAP search tells me is messages 11854 to 18202. Most of them are fine, but messages 15681 to 16051 are random spammy junk, dated various times from 2000 to 2008 when there's a decodable date. The good messages just before and after that range are from the same day so it looks like there's nothing missing, but the junk is startling.


Issue migrated from trac:2178 at 2021-09-22 16:48:38 +0500

hassanakbar4 commented 7 years ago

@hassanakbar4 changed status from new to closed

hassanakbar4 commented 7 years ago

@hassanakbar4 changed resolution from ` tofixed`