istresearch / scrapy-cluster

This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.
http://scrapy-cluster.readthedocs.io/
MIT License
1.18k stars 324 forks source link

Circuit breaker design patterns #85

Open madisonb opened 8 years ago

madisonb commented 8 years ago

Lots of the individual components break down or crash when their required infrastructure is not available. They are dependent on kafka, redis, or zookeeper, but don't have good mechanisms always in place to handle when things go down or subsequently recover. This ticket is to ensure we both log "Unable to connect to XXXX" as well as establish any better circuit breaker design patterns to ensure the components are always up.

Note that some libraries we use may already have this in place, but would be nice to log the error while we are in a recovery state.