cockroachdb / cockroach

CockroachDB — the cloud native, distributed SQL database designed for high availability, effortless scale, and control over data placement.
https://www.cockroachlabs.com
Other
30.1k stars 3.81k forks source link

Grafana dashboard for "Backup Schedule" #57943

Open mohsenasm opened 3 years ago

mohsenasm commented 3 years ago

Is your feature request related to a problem? Please describe. There are some helpful Grafana dashboard provided in this link. It would be nice if you provide another dashboard for managing backup schedules as well. Of course, it's not an urgent feature request and I can create a dashboard myself, but a dashboard designed by CockroachDB would be far better.

Describe the solution you'd like A Grafana/Prometheus dashboard managing backup schedules by means of some metrics like schedules_BACKUP_started and schedules_BACKUP_failed.

Describe alternatives you've considered Creating a dashboard myself.

Additional context None.

Jira issue: CRDB-3456

blathers-crl[bot] commented 3 years ago

Hello, I am Blathers. I am here to help you get the issue triaged.

I have CC'd a few people who may be able to assist you:

If we have not gotten back to your issue within a few business days, you can try the following:

:owl: Hoot! I am a Blathers, a bot for CockroachDB. My owner is otan.

mohsenasm commented 3 years ago

Hi again, I have tried to create a Grafana dashboard myself. Although the backup schedule is working fine, it seems that there is not schedules_BACKUP_started nor schedules_BACKUP_failed metrics in the http://localhost:8080/_status/vars. These metrics are not in our Prometheus either.

Version: v20.2.3

Screen Shot 2021-01-11 at 4 19 22 PM
mwang1026 commented 3 years ago

We identified a bug that has since been fixed and backported that will be a part of the v20.2.4 release https://github.com/cockroachdb/cockroach/pull/57512

mohsenasm commented 3 years ago

Thanks!

github-actions[bot] commented 1 year ago

We have marked this issue as stale because it has been inactive for 18 months. If this issue is still relevant, removing the stale label or adding a comment will keep it active. Otherwise, we'll close it in 10 days to keep the issue queue tidy. Thank you for your contribution to CockroachDB!

mohsenasm commented 1 year ago

As of release v20.4.2, we have schedules_BACKUP_started and schedules_BACKUP_failed metrics. So I close this issue. Thanks for the bug fix.

mohsenasm commented 1 year ago

Actually, I checked again. I have two clusters of three servers, all with version v20.2.4. But only two of these six servers return metrics schedules_BACKUP_started and schedules_BACKUP_failed in response to http://localhost:8080/_status/vars.