mvdbos / php-spider

A configurable and extensible PHP web spider
MIT License
1.33k stars 234 forks source link

Add prefetch filter to cache downloads with a max age #101

Open mvdbos opened 6 months ago

mvdbos commented 6 months ago

With this prefetch filter in place, skip fetching resources that are already downloaded and younger than max age. This requires that downloads are not segmented per spider id. A simple option is to set the same spider id for runs where you want to use the cache.