hartator / wayback-machine-downloader

Download an entire website from the Wayback Machine.
Other
5.3k stars 700 forks source link

Can't download from ftp link? #206

Open Tosyk opened 3 years ago

Tosyk commented 3 years ago

for example this link can be opened in archive.org, each files can be downloaded: ftp://ftp.ea.com/pub/ea/ https://web.archive.org/web/*/ftp://ftp.ea.com/pub/ea/*

But when I use it the output is: `etting snapshot pages. found 0 snaphots to consider.

No files to download. Possible reasons:

LostAccount commented 2 years ago

Sadly, I found that anything that was once online using the ftp scheme is ignored. It's as though the wayback_machine_downloader only works with http schemes.

I tried ftp in the past as well. You are not alone. I believe this is just a limitation of the WBM downloader:

What I tried using the -l flag.

~$ wayback_machine_downloader -l "ftp://ftp.ea.com/pub"
Getting snapshot pages. found 0 snaphots to consider.

[
null
]
~$ wayback_machine_downloader -l ftp://ftp.ea.com/pub
Getting snapshot pages. found 0 snaphots to consider.

[
null
]
~$ wayback_machine_downloader -l ftp://ftp.ea.com
Getting snapshot pages.. found 42 snaphots to consider.

[
{"file_url":"http://www.ftp.ea.com/robots.txt","timestamp":"20130713212405","file_id":"robots.txt"}
]
~$ wayback_machine_downloader -l ftp.ea.com
Getting snapshot pages.. found 42 snaphots to consider.

[
{"file_url":"http://www.ftp.ea.com/robots.txt","timestamp":"20130713212405","file_id":"robots.txt"}
]
~$ wayback_machine_downloader -l http://ftp.ea.com
Getting snapshot pages.. found 42 snaphots to consider.

[
{"file_url":"http://www.ftp.ea.com/robots.txt","timestamp":"20130713212405","file_id":"robots.txt"}
]
Tosyk commented 2 years ago

does it worked for you to download from ftp?