Closed isaacs closed 10 years ago
We are seeing this too while trying to replicate the npm registry.
Why is this a fatal error? https://github.com/iriscouch/follow/blob/master/lib/feed.js#L354-L355
if(! self.pending.wait_timer)
return self.die(new Error('Cannot find wait timer'))
clearTimeout(self.pending.wait_timer)
self.pending.wait_timer = null
Since you are clearing the timeout directly after checking if it's there, shouldn't this be:
if(! self.pending.wait_timer)
clearTimeout(self.pending.wait_timer)
self.pending.wait_timer = null
Doesn't make a lot of sense to me to die if it's not there just before you are going to clear it anyway.
@davglass im guessing the assumption is that there is something wrong if that timer has already been cleared or does not exist.
Regardless this module will be refactored as a wrapper around my changes-strem
module once i get some better test coverage. You can checkout the current wip in the refactor
branch. This will solve some of the inconsistencies.
We still see this pretty often in our production followers. It doesn't happen often enough to throw the worker into a tailspin, and we use seq-file to restart right where we left off. But still, kinda annoying.
ok so the root of this problem is actually due to a new request being created while the feed is paused because the wait_timer
expires and triggers an on_timeout()
-> retry()
in terms of function calls. This causes the resume to cause this particular failure in got_activity()
.
@davglass removing that line does seem reasonable as a stop gap so I will do some testing and publish a new version.
this is fixed in v0.11.1
. Removing that actually worked.
:thumbsup:
Occasionally this happens:
??