opensearch-project / index-management

🗃 Automate periodic data operations, such as deleting indices at a certain age or performing a rollover at a certain size
https://opensearch.org/docs/latest/im-plugin/index/
Apache License 2.0
53 stars 111 forks source link

Make transform self-healing in case of temporary exception #1052

Open RubieV opened 10 months ago

RubieV commented 10 months ago

When a cluster is busy, a query can time out. This issue typically resolves itself after a retry.

In the current situation, the whole transformation is blocked from continuing.

Retry logic added to the below code block will solve this case.

https://github.com/opensearch-project/index-management/blob/417d0d9c3ac630b720081f3ea383dea26f4a6456/src/main/kotlin/org/opensearch/indexmanagement/transform/TransformSearchService.kt#L154C21-L154C45

dblock commented 3 months ago

Catch All Triage - 1 2 3 4 5

Thanks for opening this, want to try to PR the fix as you suggested?