As an admin, I want to regularly rsync and reindex excerpts and articles that HathiTrust has updated so that we have the most current version of HathiTrust content. #626
Should build on issue handling regular rsync and reindex of full works #428.
As noted in the decision tree, if excerpt, check whether pages match in METS-XML. If they match, reindex that work and log change. If they don't match or match cannot be determined, add a log entry and flag for admin review.
Development notes:
[ ] Pull out the logic for checking matches from the script worked on in #560 and integrate it into rsync script
[ ] Investigate possibility of enhancing the checks by checking text we got from rsync against what’s indexed in Solr
Should build on issue handling regular rsync and reindex of full works #428.
As noted in the decision tree, if excerpt, check whether pages match in METS-XML. If they match, reindex that work and log change. If they don't match or match cannot be determined, add a log entry and flag for admin review.
Development notes: