Open n00bsi opened 1 month ago
Found some things:
oc describe clusteroperator/machine-config
oc delete pod node-ca-8566c -n openshift-image-registry
and all other node-* pods
oc get pods -n openshift-machine-config-operator
oc logs -f -n openshift-machine-config-operator machine-config-controller-645db999c6-xjsqs -c machine-config-controller
oc adm drain node1.domain.tld --ignore-daemonsets --force --delete-emptydir-data
https://www.neteye-blog.com/2023/08/debug-and-workarounds-for-a-stuck-update-on-openshift-4-13-6/
https://access.redhat.com/solutions/5317441
https://access.redhat.com/solutions/5598401
Now all Nodes have the same level:
Red Hat Enterprise Linux CoreOS 416.94.202409191851-0
$ oc get clusterversion
NAME VERSION AVAILABLE PROGRESSING SINCE STATUS
version 4.16.15 True False 112m Cluster version is 4.16.15
[provide a description of the issue]
Version
[provide output of the
openshift version
oroc version
command]Steps To Reproduce
Current Result
Update hat at 88%
Available: The deployment does not exist NodeCADaemonAvailable: The daemon set node-ca has available replicas ImagePrunerAvailable: Pruner CronJob has been created
$ oc get clusterversion NAME VERSION AVAILABLE PROGRESSING SINCE STATUS version 4.16.14 True True 6d Unable to apply 4.16.15: the cluster operator image-registry is not available
$ oc get clusteroperator image-registry NAME VERSION AVAILABLE PROGRESSING DEGRADED SINCE MESSAGE image-registry False True True 26m Available: The deployment does not exist...
$ oc get pvc ... ... ocs4registry Bound pvc-38960e2f-4c6b-450d-a5fe-c1a26714e496 1Gi RWX longhorn 162
...
$ oc get pods -n openshift-image-registry NAME READY STATUS RESTARTS AGE cluster-image-registry-operator-7c87776c4c-csz22 1/1 Running 0 38m node-ca-5c6gg 1/1 Running 0 38m node-ca-c492l 1/1 Running 0 38m node-ca-crzlc 1/1 Running 0 156m node-ca-dskf6 1/1 Running 0 38m node-ca-mpwjb 1/1 Running 0 38m node-ca-xmjbp 1/1 Running 0 38m