Closed madiot closed 6 years ago
Without actually doing a full test I would say that yes, your topology looks strange. Normally you should have 3 or 5 Zookeepers and 3 journalnodes in a cluster. Your setup tries to do 4 x Zookeeper and 4 x Journalnodes. I have no idea how Ambari behaves in this scenario, but nothing good I assume, looking at your output and errors.
The hdp-worker-zk
group is a special role for a 2-masternodes cluster, designed to be used by 1 node - the only worker node that runs the 2 additional master services that are required: zookeeper and journalnode.
Just re-install your cluster with 1 x hdp-worker-zk
and 3 x hdp-worker
You still have this problem @madiot ? Can we close this?
Hi,
I've walked through all the playbooks successfully, with a topology of 2NN, 4DN (2 'hdp-worker-zk' and 2 'hdp-worker' as per below.
Now, when in ambari i try to start all, Namenodes are note starting, and the log shows :
Has anyone encountered this issue? I see /hadoop/hdfs/namenode is all empty
What would be the recommended course of actions to get both namenodes up and running? Should one be started with 'hdfs namenode -boostrapStandby' and then the otherone formatted?
The hadoop version used : Hadoop 3.1.1.3.0.1.0-187 hdfs getconf -namenodes returns the expected 2 nodes
Another last thing. I have in the core site ha.zookeeper.quorum pointing to 4 nodes (2NN and 2DN) listening on port 2181. When i check each of these nodes 1 of the DN is not listening. The same host that was supposed to be a Zookeeper-server, is apparently missing the JournalNode. Could this be related to some namenode formating issue? Should i clean the hdfs config removing the missing node from following adv configs?
If so, what then is the suggested actions to take to get the namenodes formatted? and the cluster up and running?
For reference, here is the content snippets of the template host_groups definition in the playbook/group_vars/all file :