So an issue that I'm finding is that there's occasionally CONNREFUSED happening from the Daemon while the workers are doing their thing. After this error is thrown, the worker is stalled (#24). So we need a way for the workers to tell the master they're still working, and if they're not (but still alive) then the Master needs to kill them and start a new one in its place.
The problem is that cluster.workers[].process.env isn't a thing; You can't restart a worker (no cluster.workers[].restart, etc. You're basically just stuck with a dead process. This almost certainly has to do with the amount of communication we're throwing at the Daemon, but we really need a way to address this.
So an issue that I'm finding is that there's occasionally CONNREFUSED happening from the Daemon while the workers are doing their thing. After this error is thrown, the worker is stalled (#24). So we need a way for the workers to tell the master they're still working, and if they're not (but still alive) then the Master needs to kill them and start a new one in its place.
The problem is that cluster.workers[].process.env isn't a thing; You can't restart a worker (no cluster.workers[].restart, etc. You're basically just stuck with a dead process. This almost certainly has to do with the amount of communication we're throwing at the Daemon, but we really need a way to address this.