Open i25959341 opened 4 years ago
Could you explain it?
I currently use something like this https://github.com/jpillora/backoff to do back off with gocolly, was wondering if we can incorporate this into the library as it is quite valuable to our users
Doing retries is somewhat awkward in Colly, to be honest. Unless I'm missing something simple, you have to save some retry state in the context, retrieve and update it in OnRequest
and OnError
, sleeping if necessary. Everything manually.
I think most crawlers should do retries and backoff, so it would make sense if it was built in.
Good points, could you work on this?
Perhaps. I can't give any ETA yet, though.
a problem with retry is that it uses the same proxy, ideally I would want to use another proxy if I have to retry
If you have to implement it, what would the solution look like?