This is a placeholder RFD to limit the scope of OpenCHAMI for monitoring responsibility.
Each site that will be deploying OpenCHAMI has their own Logging/Metrics infrastructure that supports their needs. Some expect prometheus-style metrics exports. Many separate system metrics and logging from job or workload based monitoring.
The OpenCHAMI TSC needs to describe the limit of responsibility of the OpenCHAMI services. This includes logging and tracing of the microservices involved in delivering service, but doesn't include the metrics and monitoring of the head node and/or vms.
This is a placeholder RFD to limit the scope of OpenCHAMI for monitoring responsibility.
Each site that will be deploying OpenCHAMI has their own Logging/Metrics infrastructure that supports their needs. Some expect prometheus-style metrics exports. Many separate system metrics and logging from job or workload based monitoring.
The OpenCHAMI TSC needs to describe the limit of responsibility of the OpenCHAMI services. This includes logging and tracing of the microservices involved in delivering service, but doesn't include the metrics and monitoring of the head node and/or vms.