giantswarm / roadmap

Giant Swarm Product Roadmap
https://github.com/orgs/giantswarm/projects/273
Apache License 2.0
3 stars 0 forks source link

Setup Mimir on CAPA installation #3039

Closed TheoBrigitte closed 4 days ago

TheoBrigitte commented 7 months ago

The goal here is to replace the current monitoring on a testing CAPA installation and have Mimir in place to send alerts and visualize data via Grafana.

This solution should be fully automated, so we can replicate this setup on multiple installations by using a toggle flag which would trigger the deployment and configuration of this monitoring solution. Automation for the new monitoring setup should no be added into the existing prometheus-meta-operator.

Here are some goals to reach :

### Mimir Setup
- [ ] https://github.com/giantswarm/roadmap/issues/3087
- [ ] https://github.com/giantswarm/roadmap/issues/3088
- [ ] https://github.com/giantswarm/roadmap/issues/3089
- [ ] https://github.com/giantswarm/roadmap/issues/3152
- [ ] https://github.com/giantswarm/roadmap/issues/3158
- [ ] https://github.com/giantswarm/roadmap/issues/3162
- [ ] https://github.com/giantswarm/roadmap/issues/3127
- [ ] https://github.com/giantswarm/roadmap/issues/3157
- [ ] https://github.com/giantswarm/roadmap/issues/3163
- [ ] https://github.com/giantswarm/roadmap/issues/3298
- [ ] https://github.com/giantswarm/roadmap/issues/3160
- [ ] https://github.com/giantswarm/roadmap/issues/3159
- [ ] https://github.com/giantswarm/roadmap/issues/3301
- [ ] https://github.com/giantswarm/giantswarm/issues/30281
- [ ] https://github.com/giantswarm/roadmap/issues/3161
- [ ] https://github.com/giantswarm/giantswarm/issues/30090
- [ ] https://github.com/giantswarm/roadmap/issues/3377
- [ ] https://github.com/giantswarm/giantswarm/issues/30218
- [ ] https://github.com/giantswarm/roadmap/issues/3379
- [ ] https://github.com/giantswarm/roadmap/issues/3217
- [ ] https://github.com/giantswarm/giantswarm/issues/30310
- [ ] https://github.com/giantswarm/giantswarm/issues/30836
- [ ] https://github.com/giantswarm/giantswarm/issues/30834
- [ ] https://github.com/giantswarm/giantswarm/issues/30835
- [ ] https://github.com/giantswarm/giantswarm/issues/30837
- [ ] https://github.com/giantswarm/giantswarm/issues/30887
- [ ] https://github.com/giantswarm/giantswarm/issues/30833
- [ ] https://github.com/giantswarm/giantswarm/issues/30758
- [ ] https://github.com/giantswarm/giantswarm/issues/30963

After Mimir is succesfully setup on one installation, we can start getting our internal customers onboarded and check the new system before rolling it out everywhere and cleaning up the old system.

### Migration and Cleanup
- [ ] https://github.com/giantswarm/giantswarm/issues/30365
- [ ] https://github.com/giantswarm/giantswarm/issues/30982
- [ ] https://github.com/giantswarm/roadmap/issues/3510
- [ ] https://github.com/giantswarm/roadmap/issues/3090
- [ ] https://github.com/giantswarm/giantswarm/issues/30839
- [ ] https://github.com/giantswarm/roadmap/issues/3315
- [ ] https://github.com/giantswarm/roadmap/issues/3314
- [ ] https://github.com/giantswarm/giantswarm/issues/31094
- [ ] https://github.com/giantswarm/roadmap/issues/3505
- [ ] https://github.com/giantswarm/roadmap/issues/3218
- [ ] https://github.com/giantswarm/roadmap/issues/3513
- [ ] https://github.com/giantswarm/giantswarm/issues/31111
QuentinBisson commented 2 months ago

To unblock Mimir:

QuentinBisson commented 2 months ago

cc @Rotfuks as we still need those items :)

Rotfuks commented 2 months ago

Here's the list of stories discussed from our onsite:

Need to do later:
Once we have alert review finished:

Once we have mimir rolled out:

Rotfuks commented 4 days ago

This is done, hurray! Great job @giantswarm/team-atlas !!!! :confetti_ball: