WordPress / openverse

Openverse is a search engine for openly-licensed media. This monorepo includes all application code.
https://openverse.org
MIT License
243 stars 194 forks source link

Create an "acknowledgement of harmful content" type page #2595

Open sarayourfriend opened 1 year ago

sarayourfriend commented 1 year ago

Problem

This is somewhat related to https://github.com/WordPress/openverse/issues/2550 and could potentially be merged into it

Also related to #2593.

Openverse aggregates works from sources that include catalogue materials with harmful (racist, violent, colonial, etc) content. For example, this result from Boston Public Library includes a violent racial slur both in the title and the text in the image (CW: racial slurs, racism). This work is sourced from BPL but hosted on Flickr. The Flickr page does not include any information about the historical context of the work either, so this example would not be covered by #2593, though the image would be marked sensitive based on the sensitive text detection work.

However, none of that would necessarily make it clear why Openverse displays such content.

Description

In addition to #2550, either as a section of that page or a separate page altogether, make explicit the fact that Openverse aggregates works from sources that hold works in their collections that include harmful content. I don't think we need to use the term "sensitive" in this particular case, because we're discussing specifically works that have been used to harm specific communities. This is different from "sensitive" in a sense because it's universally true that racial slurs have been and are used to harm people, whereas "sensitivity" is more about subject tolerance of exposure to such things and also includes broader things like nudity which aren't necessarily harmful. The distinction is important.

The link to this page should be available in the pages menu and either:

  1. On every single results page
  2. On single result pages with some subset of our "sensitive terms list"
  3. On every single results page for particular providers

My preference is the first one with the third one being a runner-up if we need to exclude it from some providers for some reason. I don't think the second one is viable because it would require us to decide which of the terms in our sensitive terms list are "harmful" vs "sensitive".

This would need to be available alongside provider-specific links to similar acknowledgements/explanations as described in #2593.

AetherUnbound commented 11 months ago

@sarayourfriend do you feel this issue was addressed in the content from #3074 and can be closed?

sarayourfriend commented 11 months ago

No, the sensitive content explanation page does not cover the full extent of what this issue is about. The goal behind this issue and the page/feature it proposes is to directly acknowledge that there is harmful content, which goes beyond "sensitive" content. Harmful content includes things like historical racialised, violent descriptions and depictions of people present in GLAM institution catalogues, which we include in Openverse (it's relatively easy to find these if you search for common terms and include sensitive results).

The idea here is that we should explain why these results exist at all, even beyond the "non-censorship" perspective present in the sensitive content page, which explains why we don't necessarily remove things (while reserving the right to remove things at the project's discretion). But certain things, like items that are undoubtedly violent and harmful, as part of the historical record contained in GLAM institution catalogues, we wouldn't remove (at least I don't think we would or should). That's a separate issue from sensitive content, from my perspective.

It's also critical, as I mentioned in the issue description, to include these on results where the GLAM institution themselves have acknowledged the harmful content. From a reputational perspective, for Openverse and the provider, it's important to include these acknowledgements. Openverse's presentation of those results would not be in line with the way those providers themselves present the work, leaving out significant interpretive aids with context necessary to understand the work.