uga-libraries / web-archive-it-api

Scripts for using the Archive-It APIs to generate reports.
Creative Commons Attribution Share Alike 4.0 International
0 stars 0 forks source link

Add data limits #3

Open amhanson9 opened 1 year ago

amhanson9 commented 1 year ago

One common use of the seed report is to look for missing metadata for the quarterly preservation download. There are some seeds which will never have complete metadata and always show up in the report. They were just for testing, are departments that don't use the preservation workflow, or were tried but never successfully crawled. Being able to filter these seeds by crawl dates or some other data limit would reduce this noise.

amhanson9 commented 1 year ago

This could potentially help with the collection metadata report as well, but there are such a small, stable number of collections that it isn't as much of a problem.