kubevirt / kubevirtci

Contains cluster definitions and client tools to quickly spin up and destroy ephemeral and scalable k8s and ocp clusters for testing
Apache License 2.0
79 stars 119 forks source link

monitoring: Update kubevirtci prometheus stack to more recent versions #1154

Open brianmcarey opened 4 months ago

brianmcarey commented 4 months ago

The monitoring stack that is deployed as part of kubevirtci has not been updated in some time.

This stack includes a number of services[1] and is based on kube-prometheus :

The manifests for each of these services will have to updated.

Follow the KUBEVIRTCI_LOCAL_TESTING.md document to test the updates locally. https://github.com/kubevirt/kubevirtci/blob/main/KUBEVIRTCI_LOCAL_TESTING.md#start-cluster-with-prometheus-alertmanager-and-grafana

As part of this task it may be good to capture the steps taken as part of the upgrade as the document is lacking details: https://github.com/kubevirt/kubevirtci/tree/main/cluster-provision/k8s/1.29/manifests/prometheus#upgrading

[1] https://github.com/kubevirt/kubevirtci/tree/main/cluster-provision/k8s/1.29/manifests/prometheus

brianmcarey commented 4 months ago

/good-first-issue

kubevirt-bot commented 4 months ago

@brianmcarey: This request has been marked as suitable for new contributors.

Guidelines

For more details on the requirements of such an issue, please see here and ensure that they are met.

If this request no longer meets these requirements, the label can be removed by commenting with the /remove-good-first-issue command.

In response to [this](https://github.com/kubevirt/kubevirtci/issues/1154): >/good-first-issue Instructions for interacting with me using PR comments are available [here](https://git.k8s.io/community/contributors/guide/pull-requests.md). If you have questions or suggestions related to my behavior, please file an issue against the [kubernetes/test-infra](https://github.com/kubernetes/test-infra/issues/new?title=Prow%20issue:) repository.
anishbista60 commented 3 months ago

/assign

kubevirt-bot commented 2 weeks ago

Issues go stale after 90d of inactivity. Mark the issue as fresh with /remove-lifecycle stale. Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

/lifecycle stale

dhiller commented 1 week ago

/lifecycle frozen

dhiller commented 1 week ago

@anishbista60 would you want to work on this still?

@machadovilaca would someone from your SIG be able to help out here?

anishbista60 commented 1 week ago

@dhiller yes sir want to work on. I was working in another issue which is successfully merged . Now, i'll start working on this. Would be grateful if i'll get support from any member.

anishbista60 commented 1 week ago

@brianmcarey Do i need to update the above stack's manifest according to the below manifest ? https://github.com/prometheus-operator/kube-prometheus/tree/main/manifests

brianmcarey commented 1 week ago

@brianmcarey Do i need to update the above stack's manifest according to the below manifest ? https://github.com/prometheus-operator/kube-prometheus/tree/main/manifests

Yes I think that should be it. I think you would only need to focus on the manifests in the latest provider k8s-1.31