Closed mwylde closed 5 months ago
This PR uses the new config system to make more parts of the system (which currently have hard-coded values) configurable:
default-checkpoint-interval
pipeline.allowed-restarts
pipeline.healthy-duration
pipeline.worker-heartbeat-timeout
pipeline.worker-startup-time
pipeline.task-startup-time
This PR uses the new config system to make more parts of the system (which currently have hard-coded values) configurable:
default-checkpoint-interval
: Default interval for checkpointingpipeline.allowed-restarts
: How many restarts to allow before moving to failed (-1 for infinite)pipeline.healthy-duration
: After this amount of time, we consider the job to be healthy and reset the restarts counterpipeline.worker-heartbeat-timeout
: Number of seconds to wait for a worker heartbeat before considering it deadpipeline.worker-startup-time
: Amount of time to wait for workers to start up before considering them failedpipeline.task-startup-time
: Amount of time to wait for tasks to startup before considering it failed