renzhengeek / issues

0 stars 0 forks source link

sle12 sp1: systemctl stop pacemaker: never return #63

Open renzhengeek opened 8 years ago

renzhengeek commented 8 years ago

node2:

2016-02-18T14:19:50.153717+08:00 ocfs2test2 sbd: [3292]: info: Watchdog enabled.
2016-02-18T14:19:51.232268+08:00 ocfs2test2 lrmd[2305]:   notice: finished - rsc:stonith-sbd action:start call_id:20  exit-code:0 exec-time:1153ms queue-time:0ms
2016-02-18T14:19:51.243023+08:00 ocfs2test2 crmd[2308]:   notice: Operation stonith-sbd_start_0: ok (node=ocfs2test2, call=20, rc=0, cib-update=20, confirmed=true)
2016-02-18T14:19:51.249101+08:00 ocfs2test2 crmd[2308]:   notice: Our peer on the DC (ocfs2test1) is dead
2016-02-18T14:19:51.249390+08:00 ocfs2test2 crmd[2308]:   notice: State transition S_NOT_DC -> S_ELECTION [ input=I_ELECTION cause=C_CRMD_STATUS_CALLBACK origin=peer_update_callback ]
2016-02-18T14:19:51.255631+08:00 ocfs2test2 crmd[2308]:   notice: State transition S_ELECTION -> S_PENDING [ input=I_PENDING cause=C_FSA_INTERNAL origin=do_election_count_vote ]
2016-02-18T14:19:51.256280+08:00 ocfs2test2 attrd[2306]:   notice: crm_update_peer_proc: Node ocfs2test1[318951508] - state is now lost (was member)
2016-02-18T14:19:51.256435+08:00 ocfs2test2 attrd[2306]:   notice: Removing all ocfs2test1 attributes for attrd_peer_change_cb
2016-02-18T14:19:51.256561+08:00 ocfs2test2 attrd[2306]:   notice: Lost attribute writer ocfs2test1
2016-02-18T14:19:51.256932+08:00 ocfs2test2 attrd[2306]:   notice: Removing ocfs2test1/318951508 from the membership list
2016-02-18T14:19:51.257065+08:00 ocfs2test2 attrd[2306]:   notice: Purged 1 peers with id=318951508 and/or uname=ocfs2test1 from the membership cache
2016-02-18T14:19:51.258926+08:00 ocfs2test2 stonith-ng[2304]:   notice: crm_update_peer_proc: Node ocfs2test1[318951508] - state is now lost (was member)
2016-02-18T14:19:51.259083+08:00 ocfs2test2 stonith-ng[2304]:   notice: Removing ocfs2test1/318951508 from the membership list
2016-02-18T14:19:51.259507+08:00 ocfs2test2 stonith-ng[2304]:   notice: Purged 1 peers with id=318951508 and/or uname=ocfs2test1 from the membership cache
2016-02-18T14:19:51.260441+08:00 ocfs2test2 cib[2303]:   notice: crm_update_peer_proc: Node ocfs2test1[318951508] - state is now lost (was member)
2016-02-18T14:19:51.260595+08:00 ocfs2test2 cib[2303]:   notice: Removing ocfs2test1/318951508 from the membership list
2016-02-18T14:19:51.260864+08:00 ocfs2test2 cib[2303]:   notice: Purged 1 peers with id=318951508 and/or uname=ocfs2test1 from the membership cache
2016-02-18T14:19:51.287319+08:00 ocfs2test2 corosync[1934]:   [TOTEM ] A new membership (147.2.208.82:160) was formed. Members left: 318951508
2016-02-18T14:19:51.289537+08:00 ocfs2test2 corosync[1934]:   [QUORUM] This node is within the non-primary component and will NOT provide any services.
2016-02-18T14:19:51.289722+08:00 ocfs2test2 corosync[1934]:   [QUORUM] Members[2]: 318951506 318951507
2016-02-18T14:19:51.289883+08:00 ocfs2test2 corosync[1934]:   [MAIN  ] Completed service synchronization, ready to provide service.
2016-02-18T14:19:51.290023+08:00 ocfs2test2 pacemakerd[2301]:   notice: Membership 160: quorum lost (2)
2016-02-18T14:19:51.290202+08:00 ocfs2test2 pacemakerd[2301]:   notice: crm_reap_unseen_nodes: Node ocfs2test1[318951508] - state is now lost (was member)
2016-02-18T14:19:51.290383+08:00 ocfs2test2 crmd[2308]:   notice: Membership 160: quorum lost (2)
2016-02-18T14:19:51.290522+08:00 ocfs2test2 crmd[2308]:   notice: crm_reap_unseen_nodes: Node ocfs2test1[318951508] - state is now lost (was member)

2016-02-18T14:19:51.291318+08:00 ocfs2test2 kernel: [ 1800.035035] dlm: closing connection to node 318951508
2016-02-18T14:19:51.299855+08:00 ocfs2test2 sbd: [1918]: WARN: CIB: We do NOT have quorum!
2016-02-18T14:19:51.300105+08:00 ocfs2test2 sbd: [1916]: WARN: Pacemaker health check: UNHEALTHY
2016-02-18T14:19:51.352287+08:00 ocfs2test2 crmd[2308]:   notice: State transition S_PENDING -> S_NOT_DC [ input=I_NOT_DC cause=C_HA_MESSAGE origin=do_cl_join_finalize_respond ]
2016-02-18T14:19:52.386533+08:00 ocfs2test2 lrmd[2305]:   notice: executing - rsc:stonith-sbd action:stop call_id:21
2016-02-18T14:19:52.387316+08:00 ocfs2test2 lrmd[2305]:   notice: finished - rsc:stonith-sbd action:stop call_id:21  exit-code:0 exec-time:1ms queue-time:0ms
2016-02-18T14:19:52.387610+08:00 ocfs2test2 crmd[2308]:   notice: Operation stonith-sbd_stop_0: ok (node=ocfs2test2, call=21, rc=0, cib-update=25, confirmed=true)
2016-02-18T14:19:52.388212+08:00 ocfs2test2 lrmd[2305]:   notice: executing - rsc:ocfs2-1 action:stop call_id:23
2016-02-18T14:19:52.418654+08:00 ocfs2test2 Filesystem(ocfs2-1)[3319]: INFO: Running stop for /dev/sdb on /mnt/shared
2016-02-18T14:19:52.428163+08:00 ocfs2test2 Filesystem(ocfs2-1)[3319]: INFO: Trying to unmount /mnt/shared
2016-02-18T14:20:00.383632+08:00 ocfs2test2 pacemakerd[2301]:   notice: Invoking handler for signal 15: Terminated
2016-02-18T14:20:00.383844+08:00 ocfs2test2 pacemakerd[2301]:   notice: Shutting down Pacemaker
2016-02-18T14:20:00.384107+08:00 ocfs2test2 pacemakerd[2301]:   notice: Stopping crmd: Sent -15 to process 2308
2016-02-18T14:20:00.384269+08:00 ocfs2test2 crmd[2308]:   notice: Invoking handler for signal 15: Terminated
2016-02-18T14:20:00.384416+08:00 ocfs2test2 crmd[2308]:   notice: Requesting shutdown, upper limit is 1200000ms
2016-02-18T14:20:47.535777+08:00 ocfs2test2 dlm_controld[2404]: 1856 fence work wait for quorum
2016-02-18T14:20:50.539038+08:00 ocfs2test2 dlm_controld[2404]: 1859 DFF9D86724D949809A96A327761DD25C wait for quorum
renzhengeek commented 8 years ago

after a log time, node2 reboot

renzhengeek commented 8 years ago

2016-02-18T14:19:50.178966+08:00 ocfs2test1 dlm_controld[2569]: 2397 cpg_dispatch error 9

renzhengeek commented 8 years ago

2016-02-18T14:30:09.191973+08:00 linux-xtl7 systemd[1]: [/usr/lib/systemd/system/fstrim.timer:8] Unknown lvalue 'Persistent' in section 'Timer'