patchew-project / patchew

A patch email tracking and testing system
MIT License
72 stars 24 forks source link

Sanitize or prevent the crawling of mbox files by search engines #94

Closed pageflt closed 5 years ago

pageflt commented 5 years ago

The mbox files associated with patch series are being crawled by search engines. As consequence, contributors' e-mail addresses and names are being tied to the public IP addresses of the networks they sent their patches from, which is undesirable and a major privacy concern.

Can you please address this issue by either stripping/sanitizing the e-mail headers, or by preventing the indexing of mbox files by search engines?

As an example of the issue, you can perform a Google search for "qemu site:patchew.org inurl:/mbox".

bonzini commented 5 years ago

Everybody can get the information just by downloading the mailing list archives (see for example ftp://lists.gnu.org/qemu-devel/) or subscribing to the mailing list.

IP addresses where you send email from is public information, and anybody who is concerned of their privacy and is sending email to a public mailing list should be aware of that.