mysociety / alaveteli

Provide a Freedom of Information request system for your jurisdiction
https://alaveteli.org
Other
389 stars 196 forks source link

PDFs can get corrupted by email censoring #239

Open sebbacon opened 13 years ago

sebbacon commented 13 years ago

have only seen this once, watch for it again http://www.whatdotheyknow.com/request/information_on_traffic_flows_in The image in a "stream" section get corrupted: _#p!/DB]eER4cPAPm&W7;-]L!e(U=7"h^X7hYXqSI][9UZJV+>hr2:&c@S.lRr.ndm)2]b$-lU+#lg #p!/DB]eER4cPAPm&W7;-]L!e(_U=7"h^X7hYXqSI][9UZJV+>hr2:&x@x.xxx.xxx)2]b$-lU+#lg Needs a fancy PDF library (which doesn't exist yet) that can tell when it is binary or text stream within the file. See thread in email "corrupted pdf" for more details. Maybe have option in admin to turn off censoring on a particular file. Maybe just do an MX check to see if it is really an email :)

hsenag commented 11 years ago

Here's another example of it happening: https://www.whatdotheyknow.com/request/st_james_park_ownership#incoming-428034