agude / wayback-machine-archiver

A Python script to submit web pages to the Wayback Machine for archiving.
https://pypi.org/project/wayback-machine-archiver/
MIT License
71 stars 11 forks source link

Add ability to read sitemap files locally #15

Closed kevinmcmahon closed 4 years ago

kevinmcmahon commented 4 years ago

You can load sitemaps via HTTP requests and load URLs via files. This PR lets you read a sitemap from a file.

You can also pass a sitemap.xml file (requires the file:// prefix) to the archiver:

archiver --sitemaps file://sitemap.xml
agude commented 4 years ago

Thanks for the code @kevinmcmahon! :1st_place_medal:

Merged and released as 1.7.0 with some minor changes.