Closed joran-fonjallaz closed 1 month ago
providing more info.
the response on the controlplane's readiness probe at :10254/readyz
is a 404 page not found
. The container logs show that the contolplane fails to reach the dataplane on the admin API on port 8444
2024-05-19T07:28:04Z info setup Retrying kong admin api client call after error {"v": 0, "retries": "50/60", "error": "making HTTP request: Get \"https://10-52-2-37.dataplane-admin-kong-wgz7n-mpvzh.default.svc:8444/\": dial tcp: lookup 10-52-2-37.dataplane-admin-kong-wgz7n-mpvzh.default.svc on 10.17.176.10:53: no such host"}
the format of the URI looks wrong https://10-52-2-37.dataplane-admin-kong-wgz7n-mpvzh.default.svc:8444
, but maybe it's only the log format. I didn't dive into the operator's source code.
the response on the dataplane's readiness probe at :8100/status/ready
is a 503 Service Temporarily Unavailable
with body {"message":"no configuration available (empty configuration present)"}
which makes sense since the controlplane cannot reach the dataplane on its admin port, and thus configure it.
Hi @joran-fonjallaz,
Due to limitations in kube-dns
(which is used by default on GKE) the Admin API endpoints are unreachable using the service scoped dns names which is what ControlPlane
(KIC) uses by default (as defined by --gateway-discovery-dns-strategy
).
We have 2 issues tracking this https://github.com/Kong/gateway-operator/issues/179 and https://github.com/Kong/gateway-operator/issues/140 and a workaround which uses coredns instead: https://github.com/Kong/gateway-operator/issues/179#issuecomment-2071905771.
thank you @pmalek for getting back to me regarding this issue. Switching to CoreDNS is not an option for me. Do you have any idea if this issue will get solved for the native kube-dns
? If yes, any ticket I can track ?
I don't believe this is going to change for kube-dns anytime soon. I've created https://github.com/kubernetes/dns/issues/633 to track this feature request.
In the meantime I'm going to close this issue as it's already tracked under https://github.com/Kong/gateway-operator/issues/179 and https://github.com/Kong/gateway-operator/issues/140.
thanks again @pmalek ! So do I understand correctly, you mean that kong as no plan to support GKE for the gateway-operator any time soon ?
We do want to support GKE but as of now the only option is to use coredns instead of kube-dns.
When https://github.com/Kong/gateway-operator/issues/179 gets resolved we'll have a solution for GKE without the mentioned workaround.
following the official doc, KGO remains in a broken state, where the
controller-manager
fails with an error, and thecontrolplane
anddataplane
deployment get ready.Steps to reproduce. Create a new GKE cluster. No special config.
the
gateway-operator
(containermanager
) throw a few errors such aswith the stack trace
or such messages
the
dataplane
never becomes ready withReadiness probe failed: HTTP probe failed with statuscode: 503
and similarly for the
controlplane
withReadiness probe failed: HTTP probe failed with statuscode: 404
.I've tried also playing with the
values.yaml
, but after a lot of hours, I resolve to asking for help. I don't manage to even make it work by following you doc, and on a fresh GKE cluster.What am I missing ? Any pointer would be greatly appreciated ! Many thanks