Open njohnsn opened 8 months ago
Here is a log after I completed deleted and reinstalled k3s
nmjoo@awx-ext:~/awx-operator$ kubectl get events --field-selector involvedObject.name=awx-operator-controller-manager-589cdd869b-hzlv7 -n awx
LAST SEEN TYPE REASON OBJECT MESSAGE
79s Normal Scheduled pod/awx-operator-controller-manager-589cdd869b-hzlv7 Successfully assigned awx/awx-operator-controller-manager-589cdd869b-hzlv7 to awx-ext
79s Normal Pulling pod/awx-operator-controller-manager-589cdd869b-hzlv7 Pulling image "gcr.io/kubebuilder/kube-rbac-proxy:v0.15.0"
77s Normal Pulled pod/awx-operator-controller-manager-589cdd869b-hzlv7 Successfully pulled image "gcr.io/kubebuilder/kube-rbac-proxy:v0.15.0" in 1.856s (1.856s including waiting)
77s Normal Created pod/awx-operator-controller-manager-589cdd869b-hzlv7 Created container kube-rbac-proxy
77s Normal Started pod/awx-operator-controller-manager-589cdd869b-hzlv7 Started container kube-rbac-proxy
70s Warning Failed pod/awx-operator-controller-manager-589cdd869b-hzlv7 Failed to pull image "quay.io/ansible/awx-operator:2.12.2": failed to pull and unpack image "quay.io/ansible/awx-operator:2.12.2": failed to extract layer sha256:86426b9e591db2cdd8eba8085aa38b705422152b49960696308d988f33f3d741: failed to unmount /var/lib/rancher/k3s/agent/containerd/tmpmounts/containerd-mount3984500326: failed to unmount target /var/lib/rancher/k3s/agent/containerd/tmpmounts/containerd-mount3984500326: device or resource busy: unknown
51s Warning Failed pod/awx-operator-controller-manager-589cdd869b-hzlv7 Failed to pull image "quay.io/ansible/awx-operator:2.12.2": failed to pull and unpack image "quay.io/ansible/awx-operator:2.12.2": failed to extract layer sha256:86426b9e591db2cdd8eba8085aa38b705422152b49960696308d988f33f3d741: failed to unmount /var/lib/rancher/k3s/agent/containerd/tmpmounts/containerd-mount2900992326: failed to unmount target /var/lib/rancher/k3s/agent/containerd/tmpmounts/containerd-mount2900992326: device or resource busy: unknown
24s Normal Pulling pod/awx-operator-controller-manager-589cdd869b-hzlv7 Pulling image "quay.io/ansible/awx-operator:2.12.2"
17s Warning Failed pod/awx-operator-controller-manager-589cdd869b-hzlv7 Failed to pull image "quay.io/ansible/awx-operator:2.12.2": failed to pull and unpack image "quay.io/ansible/awx-operator:2.12.2": failed to extract layer sha256:86426b9e591db2cdd8eba8085aa38b705422152b49960696308d988f33f3d741: failed to unmount /var/lib/rancher/k3s/agent/containerd/tmpmounts/containerd-mount3255831239: failed to unmount target /var/lib/rancher/k3s/agent/containerd/tmpmounts/containerd-mount3255831239: device or resource busy: unknown
17s Warning Failed pod/awx-operator-controller-manager-589cdd869b-hzlv7 Error: ErrImagePull
5s Warning FailedToRetrieveImagePullSecret pod/awx-operator-controller-manager-589cdd869b-hzlv7 Unable to retrieve some image pull secrets (redhat-operators-pull-secret); attempting to pull the image may not succeed.
5s Normal BackOff pod/awx-operator-controller-manager-589cdd869b-hzlv7 Back-off pulling image "quay.io/ansible/awx-operator:2.12.2"
5s Warning Failed pod/awx-operator-controller-manager-589cdd869b-hzlv7 Error: ImagePullBackOff
nmjoo@awx-ext:~/awx-operator$
not cause by image pull secret see
Failed to pull image "quay.io/ansible/awx-operator:2.12.2": failed to pull and unpack image "quay.io/ansible/awx-operator:2.12.2": failed to extract layer sha256:86426b9e591db2cdd8eba8085aa38b705422152b49960696308d988f33f3d741: failed to unmount /var/lib/rancher/k3s/agent/containerd/tmpmounts/containerd-mount3255831239: failed to unmount target /var/lib/rancher/k3s/agent/containerd/tmpmounts/containerd-mount3255831239: device or resource busy: unknown
device or resource busy: unknown
this is likely cause by issue related to your k8s cluster's storage
Same here
Hey @kmf @njohnsn , as its shown here: https://github.com/ansible/awx-operator/issues/922 and as @kurokobo mentioned, I just created a dummy secret
kubectl -n awx create secret docker-registry redhat-operators-pull-secret \
--docker-server=dummy.example.com \
--docker-username=dummy \
--docker-password=dummy
Then, delete the pod to force it to restart and works fine
Please confirm the following
Bug Summary
Trying to deploy awx-operator 2.12.2 and getting "Error: ImagePullBackOff" Events. Reason is "Unable to retrieve some image pull secrets (redhat-operators-pull-secret);"
AWX Operator version
2.12.2
AWX version
Not getting that far to determine
Kubernetes platform
other (please specify in additional information)
Kubernetes/Platform version
v1.28.7+k3s1
Modifications
no
Steps to reproduce
make deploy
Expected results
awx pods in running state
Actual results
Additional information
Operator Logs