disinfoRG / ZeroScraper

Web scraper made by 0archive.
https://0archive.tw
MIT License
10 stars 2 forks source link

Update scraper takes more than 24 hours to run #52

Closed pm5 closed 4 years ago

pm5 commented 4 years ago

It happens sometimes that execute_spiders.py --update takes more than 24 hours to run. This is problematic because there would be two daily cronjobs running at the same time. Also it seems strange that merely updating snapshots would take that long, even if update scraper is not running in parallel.