opensearch-project / OpenSearch

🔎 Open source distributed and RESTful search engine.
https://opensearch.org/docs/latest/opensearch/index/
Apache License 2.0
9.64k stars 1.77k forks source link

[Remote State] Rebuild RemotePersistedState from remote on node reboot #15691

Open shiv0408 opened 1 month ago

shiv0408 commented 1 month ago

Is your feature request related to a problem? Please describe

Currently we loose the lastAcceptedState and lastAcceptedManifest when a node is bounced. This cause full upload state uploads when cluster manager is elected. This can cause throttling if start uploading too many files to remote store and cause request throttling from remote store.

Describe the solution you'd like

We rebuild the RemotePersistedState object from Remote Store and verify with cluster state object from disk.

Related component

Cluster Manager

Describe alternatives you've considered

No response

Additional context

No response

andrross commented 2 weeks ago

[Catch All Triage - 1, 2, 3]