mschubert / clustermq

R package to send function calls as jobs on LSF, SGE, Slurm, PBS/Torque, or each via SSH
https://mschubert.github.io/clustermq/
Apache License 2.0
146 stars 27 forks source link

Jobs crashing #212

Closed rimorob closed 3 years ago

rimorob commented 3 years ago

While trying to troubleshoot the previous issue, I set clustermq.ssh.timeout (to 10, 100, 1000), and my jobs started to, and then kept crashing with the following error: Error in private$zmq$poll(sid, timeout) : 1 peer(s) lost

Removing the timeout flag and reloading the package into a new R session doesn't seem to help. Does the ssh server persist? Do I need to bring it down manually somehow?

rimorob commented 3 years ago

This was a user error. I can't delete but will close.