Closed orfeas-k closed 5 months ago
Thank you for reporting us your feedback!
The internal ticket has been created: https://warthogs.atlassian.net/browse/KF-5487.
This message was autogenerated
Code doesn't implement any metrics although there are some references in its go.mod and .sum files.
Code doesn't implement any metrics.
Code doesn't implement any prometheus metrics.
Code doesn't implement any metrics.
Pipeline steps don't expose metrics by default. Feature requests:
Code doesn't implement anything related to metrics https://github.com/kubeflow/pipelines/tree/master/backend/metadata_writer
Code doesn't implement any metrics. The only reference to metrics
is about "metrics" provided from the application for exposing artifacts in the UI.
Code doesn't implement any metrics.
Code doesn't implement any prometheus metrics.
Code doesn't implement any prometheus metrics.
Code doesn't implement any prometheus metrics.
Code doesn't implement any prometheus metrics.
up
metric of those charms)/metrics
by default) although our charm doesn't patch the service to expose this endpoint.2024-04-02T08:03:21.120Z [serve] > kubeflow-centraldashboard@0.0.2 serve
2024-04-02T08:03:21.120Z [serve] > node dist/server.js
2024-04-02T08:03:21.120Z [serve]
2024-04-02T08:03:23.555Z [serve] Initializing Kubernetes configuration
2024-04-02T08:03:23.611Z [serve] Unable to fetch Application information: 404 page not found
2024-04-02T08:03:23.611Z [serve]
2024-04-02T08:03:23.637Z [serve] "other" is not a supported platform for Metrics
2024-04-02T08:03:23.638Z [serve] Using Profiles service at http://kubeflow-profiles.kubeflow:8081/kfam
2024-04-02T08:03:23.645Z [serve] Server listening on port http://localhost:8082 (in production mode)
<pod-ip>:80802/api/metrics
returns
{"error":"Operation not supported"}%
Thus, metrics are not available from upstream in our case.
There isn't an upstream app for this charm.
Code doesn't implement any metrics.
Code doesn't implement any metrics.
There are some references to prometheus packages in go.mod and .sum files but nothing is implemented in its code.
k exec envoy-operator-0 -n kubeflow -- curl pvcviewer-operator.kubeflow.svc.cluster.local:8443/metrics
Client sent an HTTP request to an HTTPS server.
This is probably due to this note in upstream manifests. We should probably remove this from our charm command (although I didn't try and need to verify this). It could also be that we have not included the Prometheus manifests in our case.
Code doesn't implement any metrics.
Regarding all the K8s Controllers from kubeflow/kubeflow
(notebooks, profiles, tensorboards) they will get some quite useful metrics by default because of controller-runtime
golang package, that comes with Kubebuilder
https://book.kubebuilder.io/reference/metrics-reference
Those are perfect for capturing if the controllers are working as expected, and it's great it will be handled by default.
In order for this to happen though, someone upstream will need to bump the controller-runtime
package from 0.11 to 0.16.3
Context
In order to evaluate better the work for integrating CKF charms with Observability charms, we need to investigate which CKF charms provide metrics by default. The state right now with charms alerts is documented in https://github.com/canonical/bundle-kubeflow/issues/837.
At the same time, we will document also the metrics that those charms provide.
What needs to get done
Document for every charm:
Definition of Done
We know for all CKF charms what they expose.