coder / observability

Creative Commons Zero v1.0 Universal
4 stars 1 forks source link

Epic: Bundled Observability #1

Open dannykopping opened 4 months ago

dannykopping commented 4 months ago

This is the holding issue for the set of improvements we want to make based on RFC: Bundled Observability.

Goal: produce a separate Helm chart which can be installed with one script to observe a Coder deployment. Each sub-chart should contain at least one dashboard, alert, and runbook to cover its own functional requirements (i.e. the Grafana installation should have a dashboard, alert, runbook to enable operators to observe it). The Coder deployment should be covered by several dashboards, alerts & runbooks, and we should collect all telemetry signals as applicable (metrics, logs, traces, profiles).

Initial Requirements:

Eventual Requirements:

dannykopping commented 3 months ago

Adhoc task list:

General

Prometheus / Alertmanager

Grafana

Loki

bpmct commented 3 months ago

Can we add this as a step in our Kubernetes install docs as Recommended, but optional? https://coder.com/docs/v2/latest/install/kubernetes

dannykopping commented 3 months ago

Can we add this as a step in our Kubernetes install docs as Recommended, but optional? https://coder.com/docs/v2/latest/install/kubernetes

Absolutely, will add an item