gdamdam / saitan

Saitan allows you to save a webpage from the Internet to a web archiving tool like the Internet Archive’s Wayback Machine and archive.is. Saitan allows you also to download a local copy of the page, and all its components in a WARC file, it can timestamp it to prove that the file existed prior to some point in time.
GNU General Public License v3.0
12 stars 3 forks source link

This REPO is DEPRECATED!

saitan - v0.9 - if you see something, save something!

Saitan allows you to save a webpage from the Internet to a web archiving tool like the Internet Archive's Wayback Machine and archive.is. Saitan allows you also to download a local copy of the page, and all its components in a WARC file, calculate its checksum SHA256 and timestamp it to prove that the file existed prior to some point in time using the free service provided by opentimestamps.org. To open the WARC file we recommend you use webrecorderplayer.

Installation

To run this script you need python3, clone this repository and install some extra packages

git clone https://github.com/gdamdam/saitan.git
cd saitan
pip3 install -r requirements.txt

IMPORTANT: You need to have wget installed on you machine.

Usage

To save a page on the Wayback Machine you can use:

python3 saitan.py -w http://exemple.com

to save it on archive.is:

python3 saitan.py -a http://exemple.com

to save a local copy in a WARC file:

python3 saitan.py -l http://example.com

You can save the page in several place with a single command concatenating the arguments. To save a page on the wayback machine, archive.is and in a local WARC file:

python3 saitan.py -lwa http://example.com

To timestamp the WARC file using opentimestamps.org you can type:

python3 saitan.py -lo http://example.com

To see a complete list of the available arguments you can type:

python3 saitan.py -h

optional arguments:
-h, --help            show this help message and exit
--waybackmachine, -w  Saves [URL] to the waybackmachine
--archiveis, -a       Saves [URL] to archive.is
--localcopy, -l       Save [URL] locally in a WARC file.
--opentimestamp, -o   Create a timestamp of the WARC file using opentimestaps.org
--sha256, -s          Returns the checksum SHA256 of the WARC file.

Open the WARC file

To open the WARC file we recommend to use Webrecorder Player.

Verify the timestamp

The file timestamped and the timestamp file .ots must be stored together to allow future verification. To verify the timestamp you can use the form on the website https://opentimestamps.org. Remember that it takes a few hours for the timestamp to get confirmed.

To download content from the Wayback Machine

To download a website or a page from the waybackmachine you can use the Wayback Machine Downloader: https://github.com/hartator/wayback-machine-downloader/

To download from archive.is

Follow the script's output instructions.

Known Issues

License (GPLv3)

Copyright (C) 2018 saitan developers.

This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program. If not, see http://www.gnu.org/licenses/.