Closed computate closed 1 day ago
See the state of the nerc-ocp-test cluster here.
The gatekeeper-controller-manager Deployment pods in the gatekeeper-system
namespace are failing with an ImagePullBackOff: Back-off pulling image "openpolicyagent/gatekeeper:v3.17.1"
error.
The issue is docker rate limiting these image pulls Failed to pull image "openpolicyagent/gatekeeper:v3.17.1": reading manifest v3.17.1 in docker.io/openpolicyagent/gatekeeper: toomanyrequests: You have reached your pull rate limit. You may increase the limit by authenticating and upgrading: https://www.docker.com/increase-rate-limit
@larsks or @jtriley is there a way to configure a default docker image pull secret for the cluster for someone's docker credentials, or should I temporarily add mine to fix this issue? Open to ideas of best practices for this operator image on Docker.
@computate you can absolutely set a default pull secret for the cluster. However, I think best practice is for each project to set up their own default pull secret in their project namespace, rather than relying on the cluster default.
@larsks do you know who is the user of the Gatekeeper application in the test cluster?
@computate we use Gatekeeper to manage policies for the rhods-notebooks namespace on the production cluster. @IsaiahStapleton may have used it on the test cluster in order to validate policies before deploying them in production.
@larsks thanks for your help resolving these issues on the test cluster.
$ k get clusterversion
NAME VERSION AVAILABLE PROGRESSING SINCE STATUS
version 4.15.28 True False 81d Cluster version is 4.15.28
Motivation
The
nerc-ocp-test
cluster operator network is in a degraded state.Completion Criteria
Make sure all the cluster operators are
Available
in thenerc-ocp-test
cluster.Description
Error while updating operator configuration: could not apply (/v1, Kind=Namespace) /openshift-network-node-identity: failed to apply / update (/v1, Kind=Namespace) /openshift-network-node-identity: Internal error occurred: failed calling webhook "check-ignore-label.gatekeeper.sh": failed to call webhook: Post "[https://gatekeeper-webhook-service.gatekeeper-system.svc:443/v1/admitlabel?timeout=3s](https://gatekeeper-webhook-service.gatekeeper-system.svc/v1/admitlabel?timeout=3s)": no endpoints available for service "gatekeeper-webhook-service"
Completion dates
Desired - 2024-12-04