spatie / crawler

An easy to use, powerful crawler implemented in PHP. Can execute Javascript.
https://freek.dev/308-building-a-crawler-in-php
MIT License
2.51k stars 357 forks source link

Crawler simply stops #442

Closed mrtnmueller closed 1 year ago

mrtnmueller commented 1 year ago

I have an issue where the crawler stops without giving any hint about why. Seems to be non-deterministic, always at another point during the crawl process.

I have some middlewares (retry, proxy, user agent header) and a redis queue but I don't think they create the problem.

The real problem seems to be that I add URLs dynamically in the observer using $crawler->addToCrawlQueue in the crawled function.

Do I need to call something else to make the crawler crawl all URLs on queue?