gocrane / crane

Crane is a FinOps Platform for Cloud Resource Analytics and Economics in Kubernetes clusters. The goal is not only to help users to manage cloud cost easier but also ensure the quality of applications.
https://gocrane.io
Apache License 2.0
1.85k stars 378 forks source link

dashboard启动失败, host not found in resolver #842

Closed charnet1019 closed 1 year ago

charnet1019 commented 1 year ago

craned pod启动失败,查看日志有oom

crane安装命令

helm install crane -n kubesphere-monitoring-system --set craned.containerArgs.prometheus-address=http://prometheus-k8s.kubesphere-monitoring-system:9090 crane/crane
[root@ksmaster01 ~]# kubectl logs -f craned-6887b9cfdf-89w2d -n kubesphere-monitoring-system
Defaulted container "craned" out of: craned, dashboard
I0803 09:53:58.184621       1 nodelocal.go:25] Registering node local metrics collector cpu
I0803 09:53:58.184807       1 nodelocal.go:25] Registering node local metrics collector cpuLoad
I0803 09:53:58.184814       1 nodelocal.go:25] Registering node local metrics collector diskio
I0803 09:53:58.184818       1 nodelocal.go:25] Registering node local metrics collector memory
I0803 09:53:58.184838       1 nodelocal.go:25] Registering node local metrics collector netio
I0803 09:53:58.246256       1 predictor.go:121] predictors map[dsp:0xc0001ee340 percentile:0xc0002392c0]
I0803 09:53:58.249081       1 webhook.go:103] Succeed to setup autoscaling webhook
I0803 09:53:58.252269       1 manager.go:128] Recommendation Config updated.
I0803 09:53:58.252774       1 manager.go:84] Start watching /tmp/recommendation-config/config.yaml for update.
I0803 09:53:58.255554       1 predictor.go:141] predictor manager started, all predictors started
I0803 09:53:58.255653       1 prediction.go:151] predictor Periodic started
I0803 09:53:58.255693       1 prediction.go:302] predictor Percentile started
I0803 09:53:58.255704       1 leaderelection.go:248] attempting to acquire leader lease kubesphere-monitoring-system/craned...
I0803 09:53:58.335265       1 server.go:94] install crane api server middleware: log
I0803 09:53:58.335286       1 server.go:94] install crane api server middleware: cors
I0803 09:53:58.335292       1 server.go:94] install crane api server middleware: recovery
I0803 09:53:58.335856       1 server.go:149] Start to listening on http address: 0.0.0.0:8082
I0803 09:54:14.257621       1 leaderelection.go:258] successfully acquired lease kubesphere-monitoring-system/craned
I0803 09:54:14.360924       1 recorder.go:81] pod namespace kubesphere-logging-system, name elasticsearch-logging-data-2, container name elasticsearch, memory {{1610612736 0} {<nil>}  BinarySI},oom happens!
I0803 09:54:14.361362       1 recorder.go:81] pod namespace kubesphere-logging-system, name elasticsearch-logging-data-1, container name elasticsearch, memory {{1610612736 0} {<nil>}  BinarySI},oom happens!
I0803 09:54:14.362088       1 recorder.go:81] pod namespace kubesphere-logging-system, name elasticsearch-logging-data-0, container name elasticsearch, memory {{1610612736 0} {<nil>}  BinarySI},oom happens!
I0803 09:54:14.362577       1 recorder.go:81] pod namespace kubesphere-logging-system, name fluentbit-operator-b746b576c-6mf7g, container name fluentbit-operator, memory {{20971520 0} {<nil>} 20Mi BinarySI},oom happens!

Environment (please complete the following information):

charnet1019 commented 1 year ago

应该不是crane的问题,而是dashbaord启动失败

[root@ksmaster01 ~]# kubectl logs -f craned-6887b9cfdf-89w2d -n kubesphere-monitoring-system -c dashboard
2023/08/03 02:21:11 [emerg] 1#1: host not found in resolver "kube-dns.kube-system.svc.cluster.local" in /etc/nginx/conf.d/nginx.conf:46
nginx: [emerg] host not found in resolver "kube-dns.kube-system.svc.cluster.local" in /etc/nginx/conf.d/nginx.conf:46
charnet1019 commented 1 year ago

已解决,需要将nginx-conf中的kube-dns换为coredns