maplelabs / opensearch-scaling-manager

0 stars 11 forks source link

FetchMetrics failure on restart of opensearch [ISSUE] #472

Open meghana-rajashekar opened 1 year ago

meghana-rajashekar commented 1 year ago

Description

When scale_down is triggered, the Opensearch service is stopped and checked for cluster status. But if the scaling_manager's Fetch Metrics is trying to access OS for pushing the data at that point, it might PANIC and cause an application failure.

Suggested solutions:

  1. Add retry mechanism for opensearch connection
  2. Do not PANIC in FetchMetrics. Just log an error and continue