hapostgres / pg_auto_failover

Postgres extension and service for automated failover and high-availability
Other
1.07k stars 112 forks source link

Upgrade Failure from 2.0 to 2.1-2 #1035

Closed curtis18 closed 1 month ago

curtis18 commented 1 month ago

After apt-get upgrade, it is found that the pg_autoctl is unable to start and since the service cannot be started, it is unable to ALTER EXTENSION pgautofailover UPDATE. Is there any way to resolve the issue? Is it possible to roll back the version to 2.0? Thank you.

May 28 14:56:05 dbserver01 pg_autoctl[1986242]: 14:56:05 1986242 ERROR SQL params: 'default', '0', '14'
May 28 14:56:05 dbserver01 pg_autoctl[1986242]: 14:56:05 1986242 ERROR Failed to get nodes for group 0 in formation "default" from the monitor
May 28 14:56:05 dbserver01 pg_autoctl[1986242]: 14:56:05 1986242 ERROR Failed to query monitor to see if node id 14 has been dropped already
May 28 14:56:05 dbserver01 pg_autoctl[1986222]: 14:56:05 1986222 WARN  pg_autoctl service node-active exited with exit status 0
May 28 14:56:05 dbserver01 pg_autoctl[1986222]: 14:56:05 1986222 INFO  Restarting service node-active
May 28 14:56:05 dbserver01 pg_autoctl[1986245]: 14:56:05 1986245 INFO   /usr/bin/pg_autoctl do service node-active --pgdata /u01/pgdata/15/main -v
May 28 14:56:05 dbserver01 pg_autoctl[1986245]: 14:56:05 1986245 ERROR Monitor ERROR:  loaded "pgautofailover" library version differs from installed extension version
May 28 14:56:05 dbserver01 pg_autoctl[1986245]: 14:56:05 1986245 ERROR Monitor DETAIL:  Loaded library requires 2.1, but the installed extension version is 2.0.
May 28 14:56:05 dbserver01 pg_autoctl[1986245]: 14:56:05 1986245 ERROR Monitor HINT:  Run ALTER EXTENSION pgautofailover UPDATE and try again.
May 28 14:56:05 dbserver01 pg_autoctl[1986245]: 14:56:05 1986245 ERROR SQL query: SELECT * FROM pgautofailover.get_nodes($1, $2) WHERE node_id = $3
May 28 14:56:05 dbserver01 pg_autoctl[1986245]: 14:56:05 1986245 ERROR SQL params: 'default', '0', '14'
May 28 14:56:05 dbserver01 pg_autoctl[1986245]: 14:56:05 1986245 ERROR Failed to get nodes for group 0 in formation "default" from the monitor
May 28 14:56:05 dbserver01 pg_autoctl[1986245]: 14:56:05 1986245 ERROR Failed to query monitor to see if node id 14 has been dropped already
May 28 14:56:05 dbserver01 pg_autoctl[1986222]: 14:56:05 1986222 WARN  pg_autoctl service node-active exited with exit status 0
May 28 14:56:05 dbserver01 pg_autoctl[1986222]: 14:56:05 1986222 FATAL pg_autoctl service node-active has already been restarted 5 times in the last 1 seconds, stopping >
May 28 14:56:05 dbserver01 pg_autoctl[1986225]: 14:56:05 1986225 INFO  Postgres controller service received signal SIGTERM, terminating
May 28 14:56:05 dbserver01 pg_autoctl[1986222]: 14:56:05 1986222 FATAL Something went wrong in sub-process supervision, stopping now. See above for details.
May 28 14:56:05 dbserver01 pg_autoctl[1986222]: 14:56:05 1986222 INFO  Stop pg_autoctl
May 28 14:56:05 dbserver01 pg_autoctl[1986222]: 14:56:05 1986222 FATAL Failed to start pg_autoctl keeper service, see above for details