Victor4X / wayback-clt

A Wayback Machine command line tool
MIT License
1 stars 0 forks source link

Broken output #2

Open frederikstroem opened 4 years ago

frederikstroem commented 4 years ago

Found the following output error when trying specific PDF URL.

Entering this command:

$ wayback https://databank.worldbank.org/data/download/GNIPC.pdf

Outputs:

http://web.archive.orghttps://databank.worldbank.org/data/download/GNIPC.pdf

Expected the following output:

https://web.archive.org/web/20200328161534/https://databank.worldbank.org/data/download/GNIPC.pdf
Victor4X commented 4 years ago

This looks like a limitation of the API endpoint I am using: https://archive.readme.io/docs/creating-a-snapshot (Try putting your URL in the "Try It Out" section of the page)

This might be fixable by grabbing the latest snapshot of the website from this endpoint: https://archive.readme.io/docs/website-snapshots (Assuming the POST actually creates a snapshot, but doesn't return the ID)