Closed spiffxp closed 4 years ago
/remove-help /assign
@spiffxp I marked this as In Progress based on #18915 having merged.
When can we call this complete?
PR merged 2020-08-19, which is too far ago to be able to cleanly show before/after data using testgrid or prow.k8s.io
From a local grafana instance I have that runs queries against k8s-gubernator:build, it looks like the job runs more reliably and with a comparable failure rate under load.
A screenshot of triage from 2020-08-30 is early enough to pick up the before/after performance, and things look no worse that I can see. I'm guessing the spike of failures immediately after is unrelated, or has been corrected since then
CPU limit usage
CPU limit looks reasonable. As with other jobs, we need most of the CPU up front for building; in the case all the testing cpu usage happens on nodes spun up elsewhere. If we had a shared build we could take the CPU requirements way down.
Memory limit usage
Same story with memory limit usage
/close I think this is good enough
Apologies for falling behind on this one, it should have been in Monitoring, and I just didn't have time to sit still and check in on it until now.
@spiffxp: Closing this issue.
What should be cleaned up or changed:
This is part of #18550
To properly monitor the outcome of this, you should be a member of k8s-infra-prow-viewers@kubernetes.io. PR yourself into https://github.com/kubernetes/k8s.io/blob/master/groups/groups.yaml#L603-L628 if you're not a member.
Migrate pull-kubernetes-node-e2e to k8s-infra-prow-build by adding a
cluster: k8s-infra-prow-build
field to the job:NOTE: migrating this job is not as straightforward as some of the other #18550 issues, because we also need to:
--gcp-project=k8s-jkns-pr-node-e2e
with--gcp-project-type=gce-project
Once the PR has merged, note the date/time it merged. This will allow you to compare before/after behavior.
Things to watch for the job
pull-kubernetes-node-e2e
for 6hpull-kubernetes-node-e2e
for 6hThings to watch for the build cluster
Keep this open for at least 24h of weekday PR traffic. If everything continues to look good, then this can be closed.
/wg k8s-infra /sig testing /area jobs /help