wellcomecollection / platform

Wellcome Collection Digital Platform
https://developers.wellcomecollection.org/
MIT License
48 stars 10 forks source link

Fix all the HTML files in the born-digital exports #4425

Open alexwlchan opened 4 years ago

alexwlchan commented 4 years ago

When you request the contents of an HTML file from Preservica, it "helpfully" adds its own headers and footers:

Screenshot 2020-04-15 at 16 13 32

This means all the HTML files in the storage service need to be replaced, by reingesting new packages through Archivematica.

To do this we'll need a way to get those HTML files out, which remains an open problem.

alexwlchan commented 4 years ago

We can get the HTML files out (they're on the V drive and/or accounted for), but we need to fix any broken packages already in the storage service.