Closed musa-asad closed 6 months ago
Could you add steps in the PR overview on how was this changes tested ?
Adding.
Why is the indentation different for every yaml ? For Neuron monitor its 2 but for the daemon-sets its 6 ?
This was because the indentation of the relevant spec was great for the other daemon-sets as opposed to neuron monitor. For instance, volumes:
volumes:
and
volumes:
I believe this resulted in the daemonsets trying to schedule onto fargate nodes, which will never work. This breaks the addon upgrade, as the daemonset never rolls out completely.
Description of changes: As indicated in https://github.com/aws/containers-roadmap/issues/2195, Amazon CloudWatch Observability EKS add-on currently does not have default tolerations for
cloudwatch-agent
andfluent-bit
daemonsets, which means tainted nodes won't runcloudwatch-agent
andfluent-bit
. I simply updated the deployments and daemonsets to have default tolerations and the ability for customers to override this.Test output:
Nodes:
Taint:
When running
helm upgrade --install amazon-cloudwatch-observability helm-charts/charts/amazon-cloudwatch-observability --values helm-charts/charts/amazon-cloudwatch-observability/values.yaml --set clusterName=my-cluster --set region=us-east-1 --set 'tolerations[0].operator=Exists' --set 'tolerations[0].effect=NoExecute'
:When running
helm upgrade --install amazon-cloudwatch-observability helm-charts/charts/amazon-cloudwatch-observability --values helm-charts/charts/amazon-cloudwatch-observability/values.yaml --set clusterName=my-cluster --set region=us-east-1
:By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.