Manually edit CSV to maybe address formatting mistakes and take care of the pid issue
That must be checked for correctness. Recommend doing this in a spreadsheet
Regenerate all IIIF image derivatives (delete then generate)
Regenerate all keywords pages (in _keywords directory) (delete then generate)
Regenerate search index (delete then generate)
Deleting derivatives in our case is important to clear away old data and because Wax tends to skip items if it thinks derivatives are already there, even if they are incorrect
If we want to use this data (as opposed to another digital team member with better knowledge of the CSV data), then it must be tested first, though I would recommend merging #44 and #45 first. I could then rebase this to include those changes.
Note for the future:
Once we fix up formatting and encoding issues, it looks like we should be good to look at automating all of the image, page, and search derivatives. Doing this would mean a slight change in workflow for website development, but all of the images, pages, and search index would be refreshed each time an update is pushed to GitHub to ensure that the website will always have the most up to date data. Contrast this with the current workflow of us having to manually push these derivatives (which hasn't been done in a while)
Related: #43 #46
Showing differences after the following:
pid
issue_keywords
directory) (delete then generate)Deleting derivatives in our case is important to clear away old data and because Wax tends to skip items if it thinks derivatives are already there, even if they are incorrect
If we want to use this data (as opposed to another digital team member with better knowledge of the CSV data), then it must be tested first, though I would recommend merging #44 and #45 first. I could then rebase this to include those changes.
Note for the future:
Once we fix up formatting and encoding issues, it looks like we should be good to look at automating all of the image, page, and search derivatives. Doing this would mean a slight change in workflow for website development, but all of the images, pages, and search index would be refreshed each time an update is pushed to GitHub to ensure that the website will always have the most up to date data. Contrast this with the current workflow of us having to manually push these derivatives (which hasn't been done in a while)