Closed edsu closed 1 year ago
After getting this working (and thanks for the reviews) I'm having second thoughts about merging it.
The current code has a symmetry in the way that WasapiWarcLister
and SdrWarcLister
return filenames. I could update SdrWarcLister
to return objects but that seems like a bridge too far, at least right now.
I'm leaning towards putting what I've learned here to use in creating a wasapi library & client, which (if it works) could eventually be used by was-registrar-app (if it makes sense).
@edsu 💬
After getting this working (and thanks for the reviews) I'm having second thoughts about merging it.
The current code has a symmetry in the way that
WasapiWarcLister
andSdrWarcLister
return filenames. I could updateSdrWarcLister
to return objects but that seems like a bridge too far, at least right now.I'm leaning towards putting what I've learned here to use in creating a wasapi library & client, which (if it works) could eventually be used by was-registrar-app (if it makes sense).
Sounds reasonable to me. Worth bringing up as a slack discussion?
This was helpful for getting info from Archive-It as part of an FR task this week, but I feel like it clutters up otherwise easy to read auditing code.
I'm gonna create another tool for getting information from WASAPI, and maybe (someday) integrate it with was-registrar-app.
Why was this change made? 🤔
To help resolve a production issue we wanted to see what WARC files were available from Archive-It for a given collection. This adds a task to support fetching metadata for WARC files that are available from the WASAPI provider for a given collection druid:
The CSV will contain filename, md5, sha1, size, crawl_time, crawl_start, store_time, and location.
How was this change tested? 🤨
Running the rake task in development with a copy of the production database and configuration.