Open sakridge opened 4 years ago
cc @carllin
Oof! Was the validator really far behind/starting from older snapshot? This may be happening when the validator is really far behind and there's a lot of slots it needs to chain from the tip of the cluster back to its current root.
For cases where the validator is really far behind, should we have an alternate forward repair path that just asks trusted validators for roots (instead of doing the chaining back from the tip and repairing orphans)?
Problem
I noticed repair is taking a long time sometimes on my node connected to mainnet:
Seems that add-votes-elapsed is taking up to hundreds of seconds:
Proposed Solution
Optimize