webrecorder / pywb

Core Python Web Archiving Toolkit for replay and recording of web archives
https://pypi.python.org/pypi/pywb
GNU General Public License v3.0
1.37k stars 214 forks source link

Add date ranges to access control lists #703

Open anjackson opened 2 years ago

anjackson commented 2 years ago

Is your feature request related to a problem? Please describe.

We would like to refine our URL blocks, by specifying a time range for the block, we we can limit access to a subset of the snapshots of URLs, rather than blocking the whole URL for all time.

Describe the solution you'd like

Support for a syntax like the embargo syntax, e.g. these two statements (which would have similar effects!):

org,httpbin)/ - {"access": "block", "url": "httpbin.org/", "before": "20201226"}
org,httpbin)/anything/something - {"access": "allow", "url": "http://httpbin.org/anything/something", "after":"202001226}

Describe alternatives you've considered

We could delete CDX records but we don't want to block the same URLs across all access contexts.

Additional context

lwrubel commented 2 years ago

We'd also find this useful for blocking a subset of captures.

VascoRatoFCCN commented 1 year ago

We at Arquivo.pt also want this functionality!

laurensorensen commented 3 months ago

+1 for this functionality!

wumpus commented 3 months ago

+1 we are getting redaction demands for old domains which subsequently changed ownership. This date functionality would do the job perfectly.