nerc-project / operations

Issues related to the operation of the NERC OpenShift environment
1 stars 0 forks source link

Researching Contextual Event Integration in Observability Dashboards #462

Open schwesig opened 2 months ago

schwesig commented 2 months ago

Motivation

Our current and planned observability infrastructure provides comprehensive metrics, logs, and traces, it lacks the ability to correlate these data points with specific system events. The absence of event context makes it challenging to directly link system changes, such as hardware upgrades or software deployments, with their impact on system reliability. Incorporating context around these events could significantly enhance our understanding and analysis capabilities.

Objective

Exploring methodologies and technologies for integrating contextual event markers into our observability (dashboards). Investigate both the technical feasibility and the potential value of embedding event context (e.g., hardware changes, software updates, configuration modifications) directly within our database and monitoring tools. Minimal wish to have a structured changelog that can be combined with the collected metrics.

Requirements

Completion Criteria

Description

The goal of this issue is to hopefully find solutions to create insightful and actionable observability. We aim to enhance our ability to perform root cause analysis and improve system reliability proactively.

Considerations

Discussions

schwesig commented 2 months ago

idea Event-Driven Ansible https://www.redhat.com/en/technologies/management/ansible/event-driven-ansible

computate commented 2 months ago

@schwesig Fortunately AAP Operator is installed in NERC! See https://github.com/OCP-on-NERC/nerc-ocp-config/pull/234

computate commented 2 months ago

I have worked on other event driven automation projects in Red Hat Financial Services Consulting that work perfectly with AAP. See three articles about our Event Driven Automation Portfolio Architecture here:

schwesig commented 1 month ago

https://community.grafana.com/t/how-to-create-vertical-line-in-histogram-chart/82982 https://dev.to/aleksk1ng/java-quarkus-cqrs-and-eventsourcing-microservice-example-2p8h