rancher / os

Tiny Linux distro that runs the entire OS as Docker containers
https://rancher.com/docs/os/v1.x/en/
Apache License 2.0
6.44k stars 655 forks source link

cattle-cluster-agent fails after the restart of RancherOS nodes #2853

Open TheAifam5 opened 5 years ago

TheAifam5 commented 5 years ago

RancherOS Version: (ros os version) v1.5.3

Where are you running RancherOS? (docker-machine, AWS, GCE, baremetal, etc.) baremetal

Hey!

I followed the documentation https://rancher.com/docs/rancher/v2.x/en/installation/ha/ and after the restart, cattle-cluster-agent restarts itself all the time.

Redeployment of that workload fixes the issue temporarily, after the restart must be redeployed again.

Log:

INFO: Environment: CATTLE_ADDRESS=10.42.0.8 CATTLE_CA_CHECKSUM=88809bea5138e9e83757cb3cc5d8f73225cc3d81b599c81c7346276590c0e271 CATTLE_CLUSTER=true CATTLE_INTERNAL_ADDRESS= CATTLE_K8S_MANAGED=true CATTLE_NODE_NAME=cattle-cluster-agent-5b788677c6-mfmwb CATTLE_SERVER=https://rancher.docker.zz

INFO: Using resolv.conf: nameserver 10.43.0.10 search cattle-system.svc.cluster.local svc.cluster.local cluster.local options ndots:5

ERROR: https://rancher.docker.zz/ping is not accessible (Could not resolve host: rancher.docker.zz)

Regards, TheAifam5

niusmallnan commented 5 years ago

Just want to confirm, other OS have this problem? If so, you should file an issue to rancher/rancher.