edgi-govdata-archiving / web-monitoring

Documentation and project-wide issues for the Website Monitoring project (a.k.a. "Scanner")
Creative Commons Attribution Share Alike 4.0 International
105 stars 17 forks source link

Add functionality to get cabinet ID of a specific URL #53

Closed janakrajchadha closed 7 years ago

janakrajchadha commented 7 years ago

Iterating through all cabinets and finding a specific archive is a cumbersome task. Adding a function for this will make it easy for developers and analysts to find the archives of a specific domain/site, when the need arises. This should be added to the file pf_edgi.py by @danielballan which already includes various functionalities to efficiently use the PF API.

janakrajchadha commented 7 years ago

This has been added. PR in wm processing #60 - https://github.com/edgi-govdata-archiving/web-monitoring-processing/pull/60

@suchthis I don't think I can add a label to this, will you please add the label and close this after reviewing?