codelibs / elasticsearch-river-web

Web Crawler for Elasticsearch
Apache License 2.0
234 stars 57 forks source link

How to enable retry on crawler? #125

Open lmatt-bit opened 7 years ago

lmatt-bit commented 7 years ago

Some pages may be failed during the crawling, is there any way to enable retry for the crawler? so that i can make sure all pages are downloaded. Meanwhile, is there any way to figure out which pages are not downloaded? There seems no such information in log or es. Thanks a lot.