psu-libraries / researcher-metadata

Penn State University's faculty and research metadata repository
https://metadata.libraries.psu.edu/
MIT License
7 stars 0 forks source link

Removes backfill #900

Closed ajkiessl closed 1 year ago

ajkiessl commented 1 year ago

Do not merge until the Activity Insight import has run in production after the recent release.

closes #898

ajkiessl commented 1 year ago

@Smullz622 @anaelizabethenriquez I spent a good part of the morning trying to figure out why there were still so many publications in RMD with an Activity Insight file, but no postprint status after the AI import last night. I finally realized it's because of duplicates in Activity Insight. Say we import a publication's metadata and it has a file and postprint status. Then, later in the import, we import a duplicate of that publication and it has no file or postprint status. This results in the postprint status being set back to nil (empty value). I updated the AI importer to only import the postprint status if the postprint status is present.

The only way to set the postprint status back to nil in Activity Insight is by deleting the file. So, I also added to the AI importer some code to set the postprint status in RMD to nil when we delete a file.

anaelizabethenriquez commented 1 year ago

@ajkiessl Sounds good; thanks for the heads up!