Closed harshalk91 closed 7 years ago
We have created an issue in Pivotal Tracker to manage this:
https://www.pivotaltracker.com/story/show/138128541
The labels on this github issue will be updated when the story is started.
@wfernandes @dsabeti Can you guys help me out? I am stuck on this since last couple of weeks. :)
Thanks
@harshalk91 The consistency checker is responsible for ensuring etcd is not experiencing a split brain. It seems to be failing because there is one. Please see https://github.com/cloudfoundry-incubator/etcd-release#failure-recovery for how to resolve this.
@apoydence Tried with following commands:
monit stop etcd
rm -rf /var/vcap/store/etcd/*
monit start etcd
etcd_z1/51beb17f-30db-48b0-813a-04a0af79aae3:~# monit summary
The Monit daemon 5.2.5 uptime: 11h 51m
Process 'consul_agent' running
Process 'etcd' not monitored
Process 'etcd_consistency_checker' Execution failed
Process 'etcd_metrics_server' running
Process 'metron_agent' Does not exist
System 'system_localhost' running
Facing same issue even after troubleshooting.
@dsabeti any thoughts as to what might be occuring here?
@ahevenor @dsabeti I have verified this from cf-247
to cf-251
release. Same issue occurs at bosh deploy
command. Even i tried with scratch and redeploying director, re-uploading release and stemcells still issue persists.
@harshalk91 It seems to me that the consistency checker is failing because it can't reach the etcd node. It seems that it can't reach the etcd node because the node is not yet alive:
| etcd_z1/0 (4290a801-094d-420d-96a1-cab8441cf549)* | failing | ...
| consul_agent | running | ...
| etcd | unknown | ... <=== HERE
| etcd_consistency_checker | failing | ...
Also here:
Process 'consul_agent' running
Process 'etcd' not monitored <==== HERE
Process 'etcd_consistency_checker' Execution failed
Your output above suggests that there are no logs for the etcd node itself. Can you double check that there isn't a file like /var/vcap/sys/log/etcd/etcd_ctl*
or /var/vcap/sys/log/etcd/etcd*.log
?
@evanfarrar @christianang Could we get your help debugging this?
@harshalk91 I think I might have a better idea of why etcd looks like it is not starting, I have continued the thread on https://github.com/cloudfoundry-incubator/etcd-release/issues/27 to help me keep track of it.
I'm closing this issue since conversation has moved to https://github.com/cloudfoundry-incubator/etcd-release/issues/27
I am trying to deploy cloud foundry. But i am getting this error during
bosh deploy
command Here is the error:-Here are etcd_metrics_server logs
Here is my
etcd
part from the manifest