Open javacruft opened 3 weeks ago
FWIW; we are having a similar issue and discussion with the LXD/Microcluster team in https://github.com/canonical/microovn/pull/121
@sabaini is this the schema incompatibility thingy you mentioned yesterday ?
@masnax any pointers on making it compatible with older revisions ?
@UtkarshBhatthere yes was referring to this
I could reproduce this locally by upgrading one out of three nodes from stable to edge
Steps:
In /v/l/syslog I see these messages:
Jun 14 08:29:48 aa-0 microceph.daemon[9040]: time="2024-06-14T08:29:48Z" level=debug msg="Database error" err="schema check gracefully aborted"
Jun 14 08:29:48 aa-0 microceph.daemon[9040]: time="2024-06-14T08:29:48Z" level=warning msg="Waiting for other cluster members to upgrade their versions" address="https://240.22.0.77:7443"
Which seems to hint at a schema migration issue
Ticket CEPH-766
Bit more context can be also found here https://github.com/canonical/microcluster/issues/66. The bottom line is that this is currently an expected behavior. If there's a DB schema change, all members of the cluster must upgrade before the API becomes available again. We are in the talks (last few comments in the PR mentioned by @fnordahl) about improving the error message.
By the way, this should have been fixed by #371 which included https://github.com/canonical/microcluster/pull/150.
Issue Report
What version of MicroCeph are you using
reef/edge but mix of versions - 981 and 1026
What are the steps to reproduce this issue ?
Multi-node local deployment using https://microstack.run/docs
What happens (observed behaviour) ?
I believe one of the snaps refreshed which then caused some of the clustering daemons to fail with the following error:
What were you expecting to happen ?
For the mix of different revisions to deal with this upgrade/change to the schema in a more elegant fashion