redpanda-data / deployment-automation

Cluster configuration best practices
https://redpanda.com
Apache License 2.0
64 stars 46 forks source link

feat: add rolling restart #231

Open gene-redpanda opened 1 day ago

gene-redpanda commented 1 day ago

Adds rolling restart functionality. Can be run against a cluster to restart all nodes in sequence without other changes.

gene-redpanda commented 1 day ago

Test result

gene@gene-MacBook-Pro deployment-automation % ansible-playbook ansible/operation-rolling-restart.yml --private-key artifacts/testkey --inventory artifacts/hosts_devex-cicd.ini

PLAY [Redpanda Rolling Restart] **

TASK [Gathering Facts] *** ok: [35.86.118.80]

TASK [Check cluster health] ** changed: [35.86.118.80]

TASK [Get node ID] *** changed: [35.86.118.80]

TASK [Enable maintenance mode] *** changed: [35.86.118.80]

TASK [Verify maintenance mode status] **** changed: [35.86.118.80]

TASK [Check cluster health after enabling maintenance mode] ** changed: [35.86.118.80]

TASK [Stop Redpanda service] ***** changed: [35.86.118.80]

TASK [Start Redpanda service] **** changed: [35.86.118.80]

TASK [Disable maintenance mode] ** changed: [35.86.118.80]

TASK [Verify maintenance mode is disabled] *** changed: [35.86.118.80]

TASK [Check cluster health after disabling maintenance mode] ***** changed: [35.86.118.80]

TASK [Check broker status] *** changed: [35.86.118.80]

PLAY [Redpanda Rolling Restart] **

TASK [Gathering Facts] *** ok: [34.217.38.164]

TASK [Check cluster health] ** changed: [34.217.38.164]

TASK [Get node ID] *** changed: [34.217.38.164]

TASK [Enable maintenance mode] *** changed: [34.217.38.164]

TASK [Verify maintenance mode status] **** changed: [34.217.38.164]

TASK [Check cluster health after enabling maintenance mode] ** changed: [34.217.38.164]

TASK [Stop Redpanda service] ***** changed: [34.217.38.164]

TASK [Start Redpanda service] **** changed: [34.217.38.164]

TASK [Disable maintenance mode] ** changed: [34.217.38.164]

TASK [Verify maintenance mode is disabled] *** changed: [34.217.38.164]

TASK [Check cluster health after disabling maintenance mode] ***** changed: [34.217.38.164]

TASK [Check broker status] *** changed: [34.217.38.164]

PLAY [Redpanda Rolling Restart] **

TASK [Gathering Facts] *** ok: [54.202.247.58]

TASK [Check cluster health] ** changed: [54.202.247.58]

TASK [Get node ID] *** changed: [54.202.247.58]

TASK [Enable maintenance mode] *** changed: [54.202.247.58]

TASK [Verify maintenance mode status] **** changed: [54.202.247.58]

TASK [Check cluster health after enabling maintenance mode] ** changed: [54.202.247.58]

TASK [Stop Redpanda service] ***** changed: [54.202.247.58]

TASK [Start Redpanda service] **** changed: [54.202.247.58]

TASK [Disable maintenance mode] ** changed: [54.202.247.58]

TASK [Verify maintenance mode is disabled] *** changed: [54.202.247.58]

TASK [Check cluster health after disabling maintenance mode] ***** changed: [54.202.247.58]

TASK [Check broker status] *** changed: [54.202.247.58]

PLAY RECAP *** 34.217.38.164 : ok=12 changed=11 unreachable=0 failed=0 skipped=0 rescued=0 ignored=0
35.86.118.80 : ok=12 changed=11 unreachable=0 failed=0 skipped=0 rescued=0 ignored=0
54.202.247.58 : ok=12 changed=11 unreachable=0 failed=0 skipped=0 rescued=0 ignored=0