Open CloCkWeRX opened 2 weeks ago
I suspect that real venue-specific images could also be delivered via CDN :(
Though https://dynl.mktgcdn.com repeats awfully lot in #10630
I've yet to find a useful picture hosted at mktgcdn.com -- do any exist?
We could create a simple ImageValidationPipeline
pipeline class to raise a warning in the following conditions:
something.jpg
, something.jpeg
, something.webp
, etc)mktgcdn.com
as an example)r"utm_(source|medium|campaign|term|content)"
Perhaps some kind of images_checked = True to suppress the banned CDNs, given the low hit rate?
Alternatively for mktgcdn ban all square pictures under 300 x 300px?
I think blacklisting data:URLs might be worthwhile as well, as they could be legit pictures but they are likely content you wouldn't publish to OSM
In #10621 there are many instances of:
https://dynl.mktgcdn.com/p/vfVdaxyP_jDZdsuJUp6zjXXz7jkJ87XeGPMuAnvDNYU/150x150.png
Blacklisting that CDN or URL pattern seems like it would be a good catchall.