openshift-pipelines / pipeline-service

SaaS for Tekton Pipelines
Apache License 2.0
23 stars 44 forks source link

SRVKP-4532: factor k8s throttling into task panel #1031

Closed gabemontero closed 1 month ago

gabemontero commented 1 month ago

@divyansh42 FYI

based on my monitoring of prod-rh01 via OCP metrics panel k8s throttling still correlates with the taskrun_pod_create metric so I'm filtering those taskruns from the total

the throttle_by_quota metrics are part of your SRVKP-4422 validation effort, hence the cross referernce

rh-pre-commit.version: 2.3.0 rh-pre-commit.check-secrets: ENABLED

gabemontero commented 1 month ago

yamllint error from earlier PR:

step-run-yamllint
yamllint 1.35.1
./operator/gitops/argocd/pipeline-service/openshift-pipelines/tekton-config.yaml
  199:7     error    wrong indentation: expected 4 but found 6  (indentation)

will address

gabemontero commented 1 month ago

yamllint error fixed

also widened the crash loop query @divyansh42 to include all tekton-* pods (will handle PAC separately) to cover the recent remote-resolver OOM/crashloopping.

@enarha @savitaashture FYI

gabemontero commented 1 month ago

I'm going to merge when the CI is clean so the OSP pruner is consistent for infra-deployements, but still take a look for education purposes @divyansh42

@enarha @savitaashture FYI