dragonflydb / dragonfly-operator

A Kubernetes operator to install and manage Dragonfly instances.
https://www.dragonflydb.io/docs/managing-dragonfly/operator/installation
Apache License 2.0
146 stars 34 forks source link

Instance unresponsive, pods not restarting #223

Open thecodeassassin opened 4 months ago

thecodeassassin commented 4 months ago

Hi,

The current way of doing healthchecks seems somewhat insufficient for auto-healing.

Our instance became unreponsive to commands such as GET, SET, MONITOR etc but the healthchecks were still succeeding (ping/pong).

My suggestion therefore is that the current healthcheck should either be user-configurable or a more thorough healtcheck could be done such as incrementing a value.

Abhra303 commented 4 months ago

Hi @thecodeassassin , I think this is the wrong repo for this issue as the healthcheck is defined here https://github.com/dragonflydb/dragonfly/blob/main/tools/docker/healthcheck.sh. Please open an issue in the dragonfly repo.

I agree with your observation that healthcheck can be improved. Feel free to open a PR!