TheRoddyWMS / BatchEuphoria

A library to access different kinds of cluster backends
MIT License
3 stars 5 forks source link

BE doesn't handle LSF being unavailable properly #132

Open askask opened 5 years ago

askask commented 5 years ago

When the LSF server is down, the commands like bjobs, bsub try to reconnect in a loop. Like in #79, we should somehow handle this instead of blocking the connection.

Example:

$ bjobs -a
LSF daemon (LIM) not responding ... still trying
LSF daemon (LIM) not responding ... still trying
LSF daemon (LIM) not responding ... still trying
[…]
LSF daemon (LIM) not responding ... still trying
LSF daemon (LIM) not responding ... still trying
LSF daemon (LIM) not responding ... still trying
LSF is down. Please wait ...
No job found