jsvine / waybackpack

Download the entire Wayback Machine archive for a given URL.
MIT License
2.8k stars 189 forks source link

Is there a way to limit the downloads to just zip files? #34

Closed BGMcoder closed 4 years ago

BGMcoder commented 4 years ago

There is a site I want to download, but I don't want any of the site pages - just the zip files therein. Is there a way to limit the results?

It seems like the script only downloads the index.html pages. I'm looking for site///*.zip

jsvine commented 4 years ago

No, but I believe you should be able to get what you want by querying the Wayback Machine's CDX endpoint directly: https://github.com/internetarchive/wayback/tree/master/wayback-cdx-server#url-match-scope