wellcomecollection / platform

Wellcome Collection Digital Platform
https://developers.wellcomecollection.org/
MIT License
48 stars 10 forks source link

Suppress duplicate images for AIDS posters #4874

Closed jtweed closed 4 years ago

jtweed commented 4 years ago

All of the AIDS posters are duplicated in image search. This is because the same images were ingested into both Goobi and Miro. We can't just drop Miro images when there is a METS file, as they are not always in the same image. However, because of the number of AIDS posters and their popularity we need to come up with a way to fix this instance of the problem.

Fortunately, it looks like there is one. All AIDS posters that have been through the digitisation workflow have a digcode of digaids. Now that we have this as an identifier, we can use it to suppress the creation of images from Miro, where the Miro work is linked to a work with a wellcome-digcode of digaids.

Thinking forward, it is likely that I will suggest to the production team that any Miro images that are ingested through should be given a digcode of digmiro. If they agree to that, then we'll have a way to fix this across the board.

jtweed commented 4 years ago

Dupe of #4875