Closed lwrubel closed 2 years ago
@justinlittman Is there remaining work to do on this or is it done (looking at the merged PRs above)?
I think it is done, unless someone wants to test prior to closing.
We believe this is done. Closing. Peter will run another couple tests and if any errors found will re-open.
Work from #486 surfaced the need to register
.arc.gz
files in addition towarc.gz
files. These are older web archives created with Archive-It in the FOIA collection. We may have.arc.gz
files in the future if further backfilling occurs.Example filename:
ARCHIVEIT-924-STANFORD-FOIA-20090126205218-00370-crawling015.us.archive.org.arc.gz
One example of code to edit: https://github.com/sul-dlss/was-registrar-app/blob/a27ed35d3deb6a6819f811080e88ccc4e8066845/app/services/web_archive_glob.rb#L6
Adjust WRA to register these files.