guardian / grid

The Guardian’s image management system
https://www.theguardian.com/info/developer-blog/2015/aug/12/open-sourcing-grid-image-service
Apache License 2.0
1.44k stars 119 forks source link

[graphic-content blurring] refine the blurring based on SMOUT (Sensitive Material Out) #4208

Closed twrichards closed 7 months ago

twrichards commented 7 months ago

Now only blur images where smout in when it appears capitalised in special instructions or as an entire keyword (case in-sensitive). Crucially, no longer matching on title & description too (like the other search phrases) since this was causing many false positives e.g. when description includes the word Portsmouth.

Based on all the [horrific] images I've seen when searching for 'smout' which aren't Portsmouth football games, they consistently have SMOUT in special instructions.

github-actions[bot] commented 7 months ago

Deploy build 12161 to TEST

All deployment options - [Deploy build 12161 to TEST](https://riffraff.gutools.co.uk/deployment/deployAgain?project=media-service%3A%3Agrid%3A%3Aall&build=12161&stage=TEST&updateStrategy=MostlyHarmless&action=deploy) - [Deploy parts of build 12161 to TEST by previewing it first](https://riffraff.gutools.co.uk/preview/yaml?project=media-service%3A%3Agrid%3A%3Aall&build=12161&stage=TEST&updateStrategy=MostlyHarmless)

From guardian/actions-riff-raff.

twrichards commented 7 months ago

tested locally (but with --use-TEST i.e. over a million pics) - works nicely

prout-bot commented 7 months ago

Seen on auth, usage, image-loader, metadata-editor, leases, cropper, collections, media-api, kahuna (merged by @twrichards 8 minutes and 44 seconds ago) Please check your changes!

prout-bot commented 7 months ago

Seen on thrall (merged by @twrichards 27 minutes and 3 seconds ago) Please check your changes!