ArchiveTeam / NewsGrabber

Grabbing all news.
62 stars 32 forks source link

Prevent static page requisites from being regrabbed #44

Open Arkiver2 opened 8 years ago

Arkiver2 commented 8 years ago

This could be done by taken the list of downloaded page requisites from the first run on newly discovered URLs from a website and adding these downloaded page requisite URLs to a list of URLs to be not redownloaded in the future.