sul-dlss / was_robot_suite

Robots for Web Archiving Service accessioning and dissemination
Other
0 stars 2 forks source link

Include file size in structural metadata when generating seed thumbnails #700

Closed andrewjbtw closed 6 months ago

andrewjbtw commented 6 months ago

Describe the bug Web archive seeds accessioned since March 27th, 2024 lack file size in the structural cocina. This results in embed being broken because that metadata is expected.

See:

User Impact The embed viewer doesn't work if a the seed thumbnail file size is missing.

To Reproduce Seed thumbnails are created via a special wasSeedPreassemblyWF. This can probably be reproduced with an integration test if there is one for seeds.

Expected behavior The file size should be in Cocina for all deposited files, especially if created recently.

Additional context We changed the structural cocina generation as part of the versioning work. I found this PR from March 26: https://github.com/sul-dlss/was_robot_suite/pull/674

I've analyzed the Cocina for all web archive seeds in SDR and 125 are missing file size in the Cocina. All 125 were created on or after March 27, 2024, so the problem started recently.