ClusterLabs / resource-agents

Combined repository of OCF agents from the RHCS and Linux-HA projects
GNU General Public License v2.0
488 stars 577 forks source link

galera: allow joiner to report non-Primary during initial IST #1910

Closed dciabrin closed 3 months ago

dciabrin commented 5 months ago

It seems that with recent galera versions, when a galera node joins a cluster, there is a small time window where the node is connected to the primary component of the galera cluster, but it might still be preparing its IST. During this time, it can report itself as being 'not ready' and in 'non-primary' state.

Update the galera resource agent to allow the node to be in non-primary state, but only if running a "promote" operation. Any network partition during the promotion will be caught by the promote timeout.

In reworking the promotion code, we move the check for primary partition into the "galera_monitor" function. The check works as before for regular "monitor" or "probe" operations.

Related-Bug: rhbz#2255414

knet-jenkins[bot] commented 5 months ago

Can one of the admins check and authorise this run please: https://ci.kronosnet.org/job/resource-agents/job/resource-agents-pipeline/job/PR-1910/1/input

oalbrigt commented 5 months ago

add to whitelist

oalbrigt commented 3 months ago

Thanks.