Open konstan opened 8 years ago
This can be a problem in case of long-running auto-scalable runs.
For mutable(scalable) run there is no limit on the number of the retry the node executor will made so it will not be a problem.
return another status code (e.g. 503 with Retry-After header) for ss:state RTP resource, so that node executor can act more wisely in case of Ready state.
Currently the Server doesn't tell to the client how many time to wait but instead the client implement an "exponential backoff" algorithm. Retry-After
doesn't make sense in the case of a server issue (like a 502
) because if the server is crashed it cannot determine how many time the client has to wait before retry and nginx cannot either because it doesn't know when the server will come back.
To summarize:
503 Service Unavailable
(maintenance mode). So this is why it's important to set this mode when we do a maintenance.
Here is the run aborted in Ready state due to
502
.This can be a problem in case of long-running auto-scalable runs.
Solutions might be:
5xx
) is detected in Ready state, when the service comes back don't abort the run as the first thing.ss:state
(when the value isReady
) in the caching proxy when the upstream is not available (at the moment it's 10 sec (?)).503
withRetry-After
header) forss:state
RTP resource, so that node executor can act more wisely in case of Ready state.