kalisio / kargo

A K8S based solution to build and operate Geospatial Platforms
https://kalisio.github.io/kargo/
MIT License
23 stars 6 forks source link

Add a maximum number of attempts when restarting a service #231

Closed claustres closed 7 months ago

claustres commented 3 years ago

The base restart policy is on-failure with default options meaning never give up to restart a service. This typically leads to tons of alerts when eg a database service fails because after that all connected apps/jobs will also fail recurrently on restart.

Maybe it is a good idea to set a maximum number of attempts before giving up to restart a service. It could also help to limit alerts by playing with the delay and window options.

cnouguier commented 9 months ago

Is it still relevant since we moved to K8S ?