mojaloop / project

Repo to track product development issues for the Mojaloop project.
Other
23 stars 15 forks source link

Implement Automatic Retry Mechanism in Auditing Service for Elastic Search Connectivity Issues #3837

Open ei-nghon-phoo opened 6 months ago

ei-nghon-phoo commented 6 months ago

User Story:

As a system admin and hub operator, I want to implement a mechanism for retrying to start the log service with a configured timeout when the system is restarting or down so that I can reduce the need for manual restarts of the elastic service, minimizing downtime and reduce the risk of log being not captured in the system when service is down.)

Acceptance Criteria:

  1. The system should have a configurable timeout for retrying the service restart.
  2. When the service is down, the system should automatically attempt to restart it according to the configured timeout.
  3. The retry mechanism should log each attempt and the outcome for monitoring and troubleshooting purposes.

Future consideration

  1. The system should include automated alerts or notifications to notify admins of repeated restart failures beyond the configured timeout.