mesosphere-backup / ansible-dcos

[DEPRECATED] Please consider using the Ansible Roles for DC/OS maintained by the Mesosphere SRE team
https://github.com/dcos/dcos-ansible
Apache License 2.0
37 stars 31 forks source link

problem when installing Mesosphere DC/OS in Bare Metal #25

Open khalilbenaissa opened 6 years ago

khalilbenaissa commented 6 years ago

Hello, I installed Mesosphere DC/OS in Bare Metal with 1 master, 1 agent, 1 public agent and 1 boot. The script Ansible ran very good but i couldn't access to my the dashboard of Mesosphere. I got an error with the mesos-dns.service : sept. 17 13:36:17 gzlaac01 mesos-dns[14638]: 2018/09/17 13:36:17 Failed to connect to zk-5.zk:2181: dial tcp: lookup zk-5.zk on 198.51.100.3:53: no such host sept. 17 13:36:17 gzlaac01 mesos-dns[14638]: 2018/09/17 13:36:17 Failed to connect to zk-3.zk:2181: dial tcp: lookup zk-3.zk on 198.51.100.3:53: no such host sept. 17 13:36:18 gzlaac01 mesos-dns[14638]: 2018/09/17 13:36:18 Failed to connect to zk-4.zk:2181: dial tcp: i/o timeout sept. 17 13:36:18 gzlaac01 mesos-dns[14638]: 2018/09/17 13:36:18 Failed to connect to zk-2.zk:2181: dial tcp: lookup zk-2.zk on 198.51.100.3:53: no such host sept. 17 13:36:19 gzlaac01 mesos-dns[14638]: 2018/09/17 13:36:19 Failed to connect to zk-1.zk:2181: dial tcp: lookup zk-1.zk on 198.51.100.3:53: no such host sept. 17 13:36:19 gzlaac01 mesos-dns[14638]: 2018/09/17 13:36:19 Failed to connect to zk-5.zk:2181: dial tcp: lookup zk-5.zk on 198.51.100.3:53: no such host sept. 17 13:36:19 gzlaac01 mesos-dns[14638]: 2018/09/17 13:36:19 Failed to connect to zk-3.zk:2181: dial tcp: lookup zk-3.zk on 198.51.100.3:53: no such host sept. 17 13:36:19 gzlaac01 mesos-dns[14638]: 2018/09/17 13:36:19 Failed to connect to zk-4.zk:2181: dial tcp: lookup zk-4.zk on 198.51.100.3:53: no such host sept. 17 13:36:19 gzlaac01 mesos-dns[14638]: 2018/09/17 13:36:19 Failed to connect to zk-2.zk:2181: dial tcp: lookup zk-2.zk on 198.51.100.3:53: no such host sept. 17 13:36:20 gzlaac01 mesos-dns[14638]: 2018/09/17 13:36:20 Failed to connect to zk-1.zk:2181: dial tcp: lookup zk-1.zk on 198.51.100.3:53: no such host

Can you help me please ?

dnguyen-fnx commented 6 years ago

Hi, according to the documentation, on the master node : $> sudo systemctl disable dnsmasq && sudo systemctl stop dnsmasq $> systemctl restart dcos-mesos-dns.service

Explanation : "Ensure that firewalls and any other connection-filtering mechanisms are not interfering with cluster component communications. TCP, UDP, and ICMP must be permitted.

Ensure that services that bind to port 53, which is required by DNS Forwarder (dcos-spartan.service), are disabled and stopped."

khalilbenaissa commented 6 years ago

thank you.

But you can also change the dcos version from 1.11.4 to 1.11.0, it works too.