BCDevOps / OpenShift4-RollOut

This is the primary board for all activities related to the roll out of OpenShift 4
Apache License 2.0
0 stars 2 forks source link

Aporeto pod count monitoring #475

Closed StevenBarre closed 3 years ago

StevenBarre commented 3 years ago

Describe the issue Create a custom script to monitor that the number of running Aporeto pods and compare to the number of nodes in the cluster and if the numbers do not match, raise an alarm.

Which Sprint Goal is this issue related to? Aporeto Monitoring

Additional context https://app.zenhub.com/workspaces/platform-experience-5bb7c5ab4b5806bc2beb9d15/issues/bcdevops/openshift4-rollout/261

Definition of done Checklist (where applicable)

StevenBarre commented 3 years ago

Started messing with my Nagios/Ansible pod again and ran into a bug. https://github.com/openshift/openshift-restclient-python/issues/389

StevenBarre commented 3 years ago

Will resume work in the new year.

StevenBarre commented 3 years ago

PR https://github.com/bcgov-c/platform-tools/pull/24

I might rewrite this from bash to ansible in the future, but it's a start!

StevenBarre commented 3 years ago

@mitovskaol this is now deployed to SILVER. We'll be adding some additional monitoring under other tickets.

mitovskaol commented 3 years ago

thank you for awesome new @sbarre-esit it will definitely help me sleep better at night :)