scientist-softserv / adventist_knapsack

Apache License 2.0
1 stars 0 forks source link

Importers should update files #234

Open KatharineV opened 8 months ago

KatharineV commented 8 months ago

This ticket is based on my understanding of how a feature should work and what is or is not happening in our instance of Hyku when we try to use the feature. My understanding could be wrong.

It appears to me that Bulkrax does not refresh files when an importer is set to run with update_files: true.

Sample importer with update_files: true

That importer ran on October 26, 2023. According to my understanding of the "update_files" feature, this importer should have removed existing files from matching works and newly attached the relevant files, based on its reading of the OAI feed. However, the works listed in the importer still have old files in the Items list.

Example: This Generic Work was originally imported on 2022-01-28, and the files in the Items list are still from that date.

I would expect to see new/updated files with the 2023-10-26 attachment date.

I would also expect that if I edit an existing importer and select the "reharvest" option, attached files would be removed and replaced with whatever the importer brought in. This feature is also not working. From my perspective, this appears to be related.

Existing importer on ADL staging that originally ran on 2023-07-25. Screenshots that follow show the original importer, my edit, selection of "reharvest" option, updated importer, and a sample work that still only contains the original files uploaded in July.

Image

Image

Image

Image

Image

https://adl.s2.adventistdigitallibrary.org/concern/images/20000164_early_panoramic_view_of_southwestern?locale=en