citizenlab / test-lists

URL testing lists intended for discovering website censorship
448 stars 342 forks source link

Archive pages that get added to the test list #179

Open hellais opened 7 years ago

hellais commented 7 years ago

It would be cool if as part of the CI process we also archived pages via http://archive.is/ as they are added.

This way if the site goes down or is self-censored, we still have an archived copy of it (and we get to know what the site was about when it was still accessible).

archive.is supports something called the memento API that we can use to automate this: http://mementoweb.org/depot/native/archiveis/.

hellais commented 2 years ago

There is now a way back machine public API, here are the docs for it: https://docs.google.com/document/d/1Nsv52MvSjbLb2PCpHlat0gkzw0EvtSgpKHu4mk0MnrA/edit#