vitabaks / postgresql_cluster

PostgreSQL High-Availability Cluster (based on "Patroni" and DCS "etcd" or "consul"). Automating with Ansible.
MIT License
1.29k stars 352 forks source link

patroni : Wait for port 8008 to become open on the host #560

Closed emanfeah closed 2 months ago

emanfeah commented 3 months ago

hello , i don't why this error show ..?

the ports its open in both of server

in vi vars/main.yml

patroni_cluster_bootstrap_method: "basebackup"  # or "wal-g", "pgbackrest", "pg_probackup"

# https://patroni.readthedocs.io/en/latest/replica_bootstrap.html#building-replicas
patroni_create_replica_methods:
  - pgbackrest
#  - wal_g
#  - pg_probackup
  - basebackup
ASK [patroni : Wait for port 8008 to become open on the host] *****************************************************************fatal: [10xx.3]: FAILED! => {"changed": false, "elapsed": 120, "msg": "Timeout when waiting for 10.0.30.3:8008"}
fatal: [10.0.30.8]: FAILED! => {"changed": false, "elapsed": 120, "msg": "Timeout when waiting for 10.xx.8:8008"}

NO MORE HOSTS LEFT *************************************************************************************************************
PLAY RECAP *********************************************************************************************************************
10.xx.11                 : ok=125  changed=11   unreachable=0    failed=0    skipped=247  rescued=0    ignored=0   
10.xx.3                  : ok=112  changed=5    unreachable=0    failed=1    skipped=244  rescued=0    ignored=0   
10.xx.8                  : ok=112  changed=5    unreachable=0    failed=1    skipped=244  rescued=0    ignored=0   
localhost                  : ok=0    changed=0    unreachable=0    failed=0    skipped=2    rescued=0    ignored=0   
emanfeah commented 3 months ago

also when i change to initdb

in vi vars/main.yml

patroni_cluster_bootstrap_method: "initdb"  # or "wal-g", "pgbackrest", "pg_probackup"

# https://patroni.readthedocs.io/en/latest/replica_bootstrap.html#building-replicas
patroni_create_replica_methods:
  - pgbackrest
#  - wal_g
#  - pg_probackup
  - basebackup
TASK [patroni : Wait for port 8008 to become open on the host] *****************************************************************
ok: [10.xx.11]
FAILED - RETRYING: [10.xx.11]: Check PostgreSQL is started and accepting connections on Master (1000 retries left).
vitabaks commented 3 months ago

Please attach the Patroni log.

sudo systemctl -u patroni -n 100
emanfeah commented 3 months ago

ubuntu@patroni1:~/postgresql2$ sudo systemctl -u patroni -n 100
systemctl: invalid option -- 'u'
ubuntu@patroni1:~/postgresql2$ sudo journalctl -u patroni.service -n 100
Jan 21 18:16:14 patroni1 patroni[199036]: 2024-01-21 18:16:14,511 INFO: no action. I am (patroni1), the leader with the lock
Jan 21 18:16:24 patroni1 patroni[199036]: 2024-01-21 18:16:24,511 INFO: no action. I am (patroni1), the leader with the lock
Jan 21 18:16:34 patroni1 patroni[199036]: 2024-01-21 18:16:34,511 INFO: no action. I am (patroni1), the leader with the lock
Jan 21 18:16:44 patroni1 patroni[199036]: 2024-01-21 18:16:44,512 INFO: no action. I am (patroni1), the leader with the lock
Jan 21 18:16:54 patroni1 patroni[199036]: 2024-01-21 18:16:54,512 INFO: no action. I am (patroni1), the leader with the lock
Jan 21 18:17:04 patroni1 patroni[199036]: 2024-01-21 18:17:04,512 INFO: no action. I am (patroni1), the leader with the lock
Jan 21 18:17:14 patroni1 patroni[199036]: 2024-01-21 18:17:14,512 INFO: no action. I am (patroni1), the leader with the lock
Jan 21 18:17:24 patroni1 patroni[199036]: 2024-01-21 18:17:24,512 INFO: no action. I am (patroni1), the leader with the lock
Jan 21 18:17:34 patroni1 patroni[199036]: 2024-01-21 18:17:34,512 INFO: no action. I am (patroni1), the leader with the lock
Jan 21 18:17:44 patroni1 patroni[199036]: 2024-01-21 18:17:44,512 INFO: no action. I am (patroni1), the leader with the lock
Jan 21 18:17:54 patroni1 patroni[199036]: 2024-01-21 18:17:54,512 INFO: no action. I am (patroni1), the leader with the lock
Jan 21 18:18:04 patroni1 patroni[199036]: 2024-01-21 18:18:04,513 INFO: no action. I am (patroni1), the leader with the lock
Jan 21 18:18:14 patroni1 patroni[199036]: 2024-01-21 18:18:14,513 INFO: no action. I am (patroni1), the leader with the lock
Jan 21 18:18:24 patroni1 patroni[199036]: 2024-01-21 18:18:24,513 INFO: no action. I am (patroni1), the leader with the lock
Jan 21 18:18:34 patroni1 patroni[199036]: 2024-01-21 18:18:34,513 INFO: no action. I am (patroni1), the leader with the lock
Jan 21 18:18:44 patroni1 patroni[199036]: 2024-01-21 18:18:44,513 INFO: no action. I am (patroni1), the leader with the lock
Jan 21 18:18:54 patroni1 patroni[199036]: 2024-01-21 18:18:54,513 INFO: no action. I am (patroni1), the leader with the lock
Jan 21 18:19:04 patroni1 patroni[199036]: 2024-01-21 18:19:04,513 INFO: no action. I am (patroni1), the leader with the lock
Jan 21 18:19:14 patroni1 patroni[199036]: 2024-01-21 18:19:14,509 INFO: no action. I am (patroni1), the leader with the lock
Jan 21 18:19:24 patroni1 patroni[199036]: 2024-01-21 18:19:24,509 INFO: no action. I am (patroni1), the leader with the lock
Jan 21 18:19:34 patroni1 patroni[199036]: 2024-01-21 18:19:34,554 INFO: no action. I am (patroni1), the leader with the lock
Jan 21 18:19:44 patroni1 patroni[199036]: 2024-01-21 18:19:44,514 INFO: no action. I am (patroni1), the leader with the lock
Jan 21 18:19:54 patroni1 patroni[199036]: 2024-01-21 18:19:54,514 INFO: no action. I am (patroni1), the leader with the lock
Jan 21 18:20:04 patroni1 patroni[199036]: 2024-01-21 18:20:04,510 INFO: no action. I am (patroni1), the leader with the lock
Jan 21 18:20:14 patroni1 patroni[199036]: 2024-01-21 18:20:14,514 INFO: no action. I am (patroni1), the leader with the lock
Jan 21 18:20:24 patroni1 patroni[199036]: 2024-01-21 18:20:24,514 INFO: no action. I am (patroni1), the leader with the lock
Jan 21 18:20:34 patroni1 patroni[199036]: 2024-01-21 18:20:34,510 INFO: no action. I am (patroni1), the leader with the lock
Jan 21 18:20:44 patroni1 patroni[199036]: 2024-01-21 18:20:44,514 INFO: no action. I am (patroni1), the leader with the lock
Jan 21 18:20:54 patroni1 patroni[199036]: 2024-01-21 18:20:54,510 INFO: no action. I am (patroni1), the leader with the lock
Jan 21 18:21:04 patroni1 patroni[199036]: 2024-01-21 18:21:04,511 INFO: no action. I am (patroni1), the leader with the lock
Jan 21 18:21:14 patroni1 patroni[199036]: 2024-01-21 18:21:14,515 INFO: no action. I am (patroni1), the leader with the lock
Jan 21 18:21:24 patroni1 patroni[199036]: 2024-01-21 18:21:24,511 INFO: no action. I am (patroni1), the leader with the lock
Jan 21 18:21:32 patroni1 systemd[1]: Stopping Runners to orchestrate a high-availability PostgreSQL - Patroni...
Jan 21 18:21:33 patroni1 systemd[1]: patroni.service: Deactivated successfully.
Jan 21 18:21:33 patroni1 systemd[1]: Stopped Runners to orchestrate a high-availability PostgreSQL - Patroni.
Jan 21 18:21:33 patroni1 systemd[1]: patroni.service: Consumed 4.860s CPU time.
Jan 21 18:21:37 patroni1 systemd[1]: Starting Runners to orchestrate a high-availability PostgreSQL - Patroni...
Jan 21 18:21:37 patroni1 sudo[206046]: postgres : PWD=/ ; USER=root ; COMMAND=/sbin/modprobe softdog
ubuntu@patroni3:~$ sudo journalctl -u patroni.service -n 100
Jan 21 18:21:53 patroni3 systemd[1]: Stopped Runners to orchestrate a high-availability PostgreSQL - Patroni.
Jan 21 18:21:53 patroni3 systemd[1]: Starting Runners to orchestrate a high-availability PostgreSQL - Patroni...
Jan 21 18:21:53 patroni3 sudo[101941]: postgres : PWD=/ ; USER=root ; COMMAND=/sbin/modprobe softdog
Jan 21 18:21:53 patroni3 sudo[101941]: pam_unix(sudo:session): session opened for user root(uid=0) by (uid=115)
Jan 21 18:21:53 patroni3 sudo[101941]: pam_unix(sudo:session): session closed for user root
Jan 21 18:21:53 patroni3 sudo[101943]: postgres : PWD=/ ; USER=root ; COMMAND=/bin/chown postgres /dev/watchdog
Jan 21 18:21:53 patroni3 sudo[101943]: pam_unix(sudo:session): session opened for user root(uid=0) by (uid=115)
Jan 21 18:21:53 patroni3 sudo[101943]: pam_unix(sudo:session): session closed for user root
Jan 21 18:21:53 patroni3 systemd[1]: Started Runners to orchestrate a high-availability PostgreSQL - Patroni.
Jan 21 18:21:53 patroni3 patroni[101945]: Traceback (most recent call last):
Jan 21 18:21:53 patroni3 patroni[101945]:   File "/usr/local/bin/patroni", line 8, in <module>
Jan 21 18:21:53 patroni3 patroni[101945]:     sys.exit(main())
Jan 21 18:21:53 patroni3 patroni[101945]:   File "/usr/local/lib/python3.10/dist-packages/patroni/__main__.py", >Jan 21 18:21:53 patroni3 patroni[101945]:     args = process_arguments()
Jan 21 18:21:53 patroni3 patroni[101945]:   File "/usr/local/lib/python3.10/dist-packages/patroni/__main__.py", >Jan 21 18:21:53 patroni3 patroni[101945]:     from patroni.config_generator import generate_config
Jan 21 18:21:53 patroni3 patroni[101945]:   File "/usr/local/lib/python3.10/dist-packages/patroni/config_generat>Jan 21 18:21:53 patroni3 patroni[101945]:     from .config import Config
Jan 21 18:21:53 patroni3 patroni[101945]:   File "/usr/local/lib/python3.10/dist-packages/patroni/config.py", li>Jan 21 18:21:53 patroni3 patroni[101945]:     from .postgresql.config import ConfigHandler
Jan 21 18:21:53 patroni3 patroni[101945]: ModuleNotFoundError: No module named 'patroni.postgresql'
Jan 21 18:21:53 patroni3 systemd[1]: patroni.service: Main process exited, code=exited, status=1/FAILURE
Jan 21 18:21:53 patroni3 systemd[1]: patroni.service: Failed with result 'exit-code'.
Jan 21 18:21:53 patroni3 systemd[1]: patroni.service: Scheduled restart job, restart counter is at 2.
Jan 21 18:21:53 patroni3 systemd[1]: Stopped Runners to orchestrate a high-availability PostgreSQL - Patroni.
Jan 21 18:21:53 patroni3 systemd[1]: Starting Runners to orchestrate a high-availability PostgreSQL - Patroni...
Jan 21 18:21:53 patroni3 sudo[101946]: postgres : PWD=/ ; USER=root ; COMMAND=/sbin/modprobe softdog
Jan 21 18:21:53 patroni3 sudo[101946]: pam_unix(sudo:session): session opened for user root(uid=0) by (uid=115)
Jan 21 18:21:53 patroni3 sudo[101946]: pam_unix(sudo:session): session closed for user root
Jan 21 18:21:53 patroni3 sudo[101948]: postgres : PWD=/ ; USER=root ; COMMAND=/bin/chown postgres /dev/watchdog
Jan 21 18:21:53 patroni3 sudo[101948]: pam_unix(sudo:session): session opened for user root(uid=0) by (uid=115)
Jan 21 18:21:53 patroni3 sudo[101948]: pam_unix(sudo:session): session closed for user root
Jan 21 18:21:53 patroni3 systemd[1]: Started Runners to orchestrate a high-availability PostgreSQL - Patroni.
Jan 21 18:21:53 patroni3 patroni[101950]: Traceback (most recent call last):
Jan 21 18:21:53 patroni3 patroni[101950]:   File "/usr/local/bin/patroni", line 8, in <module>
lines 1-35

Jan 21 18:21:53 patroni3 patroni[101945]: ModuleNotFoundError: No module named 'patroni.postgresql' what that mean ?

vitabaks commented 3 months ago

@emanfeah Please attach the full ansible log.

And an archive of your postgresql_cluster directory so that I can try to reproduce the problem.

vitabaks commented 3 months ago

@emanfeah Is the problem still relevant?