renzhengeek / issues

0 stars 0 forks source link

yan_gao #45

Closed renzhengeek closed 8 years ago

renzhengeek commented 8 years ago
crm(live)# status
Last updated: Tue Dec 22 18:36:08 2015
Last change: Tue Dec 22 18:35:30 2015 by root via cibadmin on sle121
Stack: corosync
Current DC: sle121 (1084783126) - partition with quorum
Version: 1.1.12-ad083a8
2 Nodes configured
5 Resources configured

Online: [ sle121 sle122 ]

 stonith_sbd    (stonith:external/sbd): Started sle121
 Clone Set: base-clone [base-group]
     Started: [ sle122 ]
     Stopped: [ sle121 ]

Failed actions:
    clvm_start_0 on sle121 'unknown error' (1): call=15, status=complete, last-rc-change='Tue Dec 22 18:35:04 2015', queued=0ms, exec=119ms
renzhengeek commented 8 years ago
sle121:~ # crm
crm(live)# configure
crm(live)configure# primitive ocfs2-1 ocf:heartbeat:Filesystem \
   >       params device="/dev/sdb1" directory="/mnt/shared" fstype="ocfs2" options="acl" \
   >       op monitor interval="20" timeout="40"
crm(live)configure# edit ocfs2-1
WARNING: ocfs2-1: default timeout 20s for start is smaller than the advised 60
WARNING: ocfs2-1: default timeout 20s for stop is smaller than the advised 60
crm(live)configure# commit
WARNING: ocfs2-1: default timeout 20s for start is smaller than the advised 60
WARNING: ocfs2-1: default timeout 20s for stop is smaller than the advised 60
crm(live)configure# edit base-group
crm(live)configure# commit
crm(live)configure# cd ..
crm(live)# status
Last updated: Tue Dec 22 18:46:41 2015
Last change: Tue Dec 22 18:46:39 2015 by root via cibadmin on sle121
Stack: corosync
Current DC: sle122 (1084783173) - partition with quorum
Version: 1.1.12-ad083a8
2 Nodes configured
9 Resources configured

Online: [ sle121 sle122 ]

 stonith_sbd    (stonith:external/sbd): Started sle122
 Clone Set: base-clone [base-group]
     Started: [ sle121 sle122 ]
renzhengeek commented 8 years ago
sle11sp41:~ # crm status
Last updated: Wed Dec 23 10:39:44 2015
Last change: Wed Dec 23 10:39:12 2015 by root via cibadmin on sle11sp41
Stack: classic openais (with plugin)
Current DC: sle11sp42 - partition with quorum
Version: 1.1.12-f47ea56
2 Nodes configured, 2 expected votes
7 Resources configured

Online: [ sle11sp41 sle11sp42 ]

 stonith-libvirt        (stonith:external/libvirt):     Started sle11sp42

Failed actions:
    o2cb_start_0 on sle11sp41 'unknown error' (1): call=22, status=complete, exit-reason='none', last-rc-change='Wed Dec 23 10:39:29 2015', queued=0ms, exec=11102ms
    o2cb_start_0 on sle11sp42 'unknown error' (1): call=22, status=complete, exit-reason='none', last-rc-change='Wed Dec 23 10:28:09 2015', queued=0ms, exec=11078ms
sle11sp41:~ # vim /var/log/messages 
Dec 23 10:39:40 sle11sp41 o2cb(o2cb)[5442]: ERROR: ocfs2_controld.pcmk did not come up
Dec 23 10:39:40 sle11sp41 crmd[5366]:   notice: process_lrm_event: Operation o2cb_start_0: unknown error (node=sle11sp41, call=22, rc=1, cib-update=13, confirmed=true)
Dec 23 10:39:40 sle11sp41 crmd[5366]:   notice: process_lrm_event: sle11sp41-o2cb_start_0:22 [  5361 ?        S<     0:00  \_ /usr/lib64/pacemaker/cib\n 5362 ?        S<     0:00  \_ /usr/lib64/pacemaker/stonithd\n 5363 ?        S<     0:00  \_ /usr/lib64/pacemaker/lrmd\n 5442 ?        S      0:00  |   \_ /bin/bash /usr/lib/ocf/resource.d/ocfs2/o2cb start\n 5525 ?        R      0:00  |       \_ ps axf\n 5526 ?        S      0:00  |       \_ grep -C 3 5442\n 5364 ?        S<     0:00  \_ /usr/lib64/pacemaker/attrd\n 5365 ?        S<     0:00  \
Dec 23 10:39:40 sle11sp41 attrd[5364]:   notice: attrd_cs_dispatch: Update relayed from sle11sp42
Dec 23 10:39:40 sle11sp41 attrd[5364]:   notice: attrd_trigger_update: Sending flush op to all hosts for: fail-count-o2cb (INFINITY)
Dec 23 10:39:40 sle11sp41 attrd[5364]:   notice: attrd_perform_update: Sent update 10: fail-count-o2cb=INFINITY
Dec 23 10:39:40 sle11sp41 attrd[5364]:   notice: attrd_cs_dispatch: Update relayed from sle11sp42
Dec 23 10:39:40 sle11sp41 attrd[5364]:   notice: attrd_trigger_update: Sending flush op to all hosts for: last-failure-o2cb (1450838380)
Dec 23 10:39:40 sle11sp41 attrd[5364]:   notice: attrd_perform_update: Sent update 12: last-failure-o2cb=1450838380
Dec 23 10:39:40 sle11sp41 attrd[5364]:   notice: attrd_cs_dispatch: Update relayed from sle11sp42
Dec 23 10:39:40 sle11sp41 attrd[5364]:   notice: attrd_trigger_update: Sending flush op to all hosts for: fail-count-o2cb (INFINITY)
Dec 23 10:39:40 sle11sp41 attrd[5364]:   notice: attrd_perform_update: Sent update 14: fail-count-o2cb=INFINITY
Dec 23 10:39:40 sle11sp41 attrd[5364]:   notice: attrd_cs_dispatch: Update relayed from sle11sp42
Dec 23 10:39:40 sle11sp41 attrd[5364]:   notice: attrd_trigger_update: Sending flush op to all hosts for: last-failure-o2cb (1450838380)
Dec 23 10:39:40 sle11sp41 attrd[5364]:   notice: attrd_perform_update: Sent update 16: last-failure-o2cb=1450838380
Dec 23 10:39:40 sle11sp41 attrd[5364]:   notice: attrd_cs_dispatch: Update relayed from sle11sp42
Dec 23 10:39:40 sle11sp41 attrd[5364]:   notice: attrd_trigger_update: Sending flush op to all hosts for: fail-count-o2cb (INFINITY)
Dec 23 10:39:40 sle11sp41 attrd[5364]:   notice: attrd_perform_update: Sent update 18: fail-count-o2cb=INFINITY
Dec 23 10:39:40 sle11sp41 attrd[5364]:   notice: attrd_cs_dispatch: Update relayed from sle11sp42
Dec 23 10:39:40 sle11sp41 attrd[5364]:   notice: attrd_trigger_update: Sending flush op to all hosts for: last-failure-o2cb (1450838380)
Dec 23 10:39:40 sle11sp41 attrd[5364]:   notice: attrd_perform_update: Sent update 20: last-failure-o2cb=1450838380
Dec 23 10:39:40 sle11sp41 attrd[5364]:   notice: attrd_cs_dispatch: Update relayed from sle11sp42
Dec 23 10:39:40 sle11sp41 attrd[5364]:   notice: attrd_trigger_update: Sending flush op to all hosts for: fail-count-o2cb (INFINITY)
Dec 23 10:39:40 sle11sp41 attrd[5364]:   notice: attrd_perform_update: Sent update 22: fail-count-o2cb=INFINITY
Dec 23 10:39:40 sle11sp41 attrd[5364]:   notice: attrd_cs_dispatch: Update relayed from sle11sp42
Dec 23 10:39:40 sle11sp41 attrd[5364]:   notice: attrd_trigger_update: Sending flush op to all hosts for: last-failure-o2cb (1450838380)
Dec 23 10:39:40 sle11sp41 attrd[5364]:   notice: attrd_perform_update: Sent update 24: last-failure-o2cb=1450838380
Dec 23 10:39:40 sle11sp41 crmd[5366]:   notice: process_lrm_event: Operation o2cb_stop_0: ok (node=sle11sp41, call=23, rc=0, cib-update=14, confirmed=true)
renzhengeek commented 8 years ago
crm(live)# status
Last updated: Wed Dec 23 10:55:41 2015
Last change: Wed Dec 23 10:52:34 2015 by root via cibadmin on sle11sp41
Stack: classic openais (with plugin)
Current DC: sle11sp41 - partition with quorum
Version: 1.1.12-f47ea56
2 Nodes configured, 2 expected votes
7 Resources configured

Online: [ sle11sp41 sle11sp42 ]

 stonith-libvirt        (stonith:external/libvirt):     Started sle11sp41

Failed actions:
    dlm_start_0 on sle11sp41 'not configured' (6): call=24, status=complete, exit-reason='none', last-rc-change='Wed Dec 23 10:53:25 2015', queued=0ms, exec=37ms

Dec 23 10:53:25 sle11sp41 controld(dlm)[6110]: ERROR: The cluster property stonith-enabled may not be deactivated to use the DLM
Dec 23 10:53:25 sle11sp41 crmd[5998]:   notice: process_lrm_event: Operation dlm_start_0: not configured (node=sle11sp41, call=24, rc=6, cib-update=45, confirmed=true)
Dec 23 10:53:25 sle11sp41 crmd[5998]:  warning: status_from_rc: Action 7 (dlm_start_0) on sle11sp41 failed (target: 0 vs. rc: 6): Error
Dec 23 10:53:25 sle11sp41 crmd[5998]:   notice: abort_transition_graph: Transition aborted by dlm_start_0 'modify' on sle11sp41: Event failed (magic=0:6;7:1:0:208d205e-2268-4a9c-b31e-936386c2ba2c, cib=0.19.19, source=match_graph_event:347, 0)
Dec 23 10:53:25 sle11sp41 crmd[5998]:  warning: update_failcount: Updating failcount for dlm on sle11sp41 after failed start: rc=6 (update=INFINITY, time=1450839205)
Dec 23 10:53:25 sle11sp41 crmd[5998]:  warning: status_from_rc: Action 7 (dlm_start_0) on sle11sp41 failed (target: 0 vs. rc: 6): Error
Dec 23 10:53:25 sle11sp41 crmd[5998]:  warning: update_failcount: Updating failcount for dlm on sle11sp41 after failed start: rc=6 (update=INFINITY, time=1450839205)
Dec 23 10:53:25 sle11sp41 crmd[5998]:   notice: run_graph: Transition 1 (Complete=4, Pending=0, Fired=0, Skipped=5, Incomplete=1, Source=/var/lib/pacemaker/pengine/pe-input-32.bz2): Stopped
Dec 23 10:53:25 sle11sp41 attrd[5996]:   notice: attrd_trigger_update: Sending flush op to all hosts for: fail-count-dlm (INFINITY)
Dec 23 10:53:25 sle11sp41 attrd[5996]:   notice: attrd_perform_update: Sent update 9: fail-count-dlm=INFINITY
Dec 23 10:53:25 sle11sp41 attrd[5996]:   notice: attrd_trigger_update: Sending flush op to all hosts for: last-failure-dlm (1450839205)
Dec 23 10:53:25 sle11sp41 pengine[5997]:   notice: unpack_config: On loss of CCM Quorum: Ignore
Dec 23 10:53:25 sle11sp41 attrd[5996]:   notice: attrd_perform_update: Sent update 11: last-failure-dlm=1450839205
Dec 23 10:53:25 sle11sp41 attrd[5996]:   notice: attrd_trigger_update: Sending flush op to all hosts for: fail-count-dlm (INFINITY)
Dec 23 10:53:25 sle11sp41 pengine[5997]:  warning: unpack_rsc_op_failure: Processing failed op start for dlm:0 on sle11sp41: not configured (6)
Dec 23 10:53:25 sle11sp41 pengine[5997]:    error: unpack_rsc_op: Preventing base-clone from re-starting anywhere: operation start failed 'not configured' (6)
Dec 23 10:53:25 sle11sp41 pengine[5997]:  warning: unpack_rsc_op_failure: Processing failed op start for dlm:0 on sle11sp41: not configured (6)
Dec 23 10:53:25 sle11sp41 pengine[5997]:    error: unpack_rsc_op: Preventing base-clone from re-starting anywhere: operation start failed 'not configured' (6)
Dec 23 10:53:25 sle11sp41 pengine[5997]:   notice: LogActions: Stop    dlm:0    (sle11sp41)
Dec 23 10:53:25 sle11sp41 pengine[5997]:   notice: process_pe_message: Calculated Transition 2: /var/lib/pacemaker/pengine/pe-input-33.bz2
Dec 23 10:53:25 sle11sp41 attrd[5996]:   notice: attrd_perform_update: Sent update 13: fail-count-dlm=INFINITY
Dec 23 10:53:25 sle11sp41 attrd[5996]:   notice: attrd_trigger_update: Sending flush op to all hosts for: last-failure-dlm (1450839205)
Dec 23 10:53:25 sle11sp41 attrd[5996]:   notice: attrd_perform_update: Sent update 15: last-failure-dlm=1450839205
Dec 23 10:53:25 sle11sp41 pengine[5997]:   notice: unpack_config: On loss of CCM Quorum: Ignore
Dec 23 10:53:25 sle11sp41 pengine[5997]:  warning: unpack_rsc_op_failure: Processing failed op start for dlm:0 on sle11sp41: not configured (6)
Dec 23 10:53:25 sle11sp41 pengine[5997]:    error: unpack_rsc_op: Preventing base-clone from re-starting anywhere: operation start failed 'not configured' (6)
Dec 23 10:53:25 sle11sp41 pengine[5997]:  warning: unpack_rsc_op_failure: Processing failed op start for dlm:0 on sle11sp41: not configured (6)
Dec 23 10:53:25 sle11sp41 pengine[5997]:    error: unpack_rsc_op: Preventing base-clone from re-starting anywhere: operation start failed 'not configured' (6)
Dec 23 10:53:25 sle11sp41 pengine[5997]:  warning: common_apply_stickiness: Forcing base-clone away from sle11sp41 after 1000000 failures (max=3)
Dec 23 10:53:25 sle11sp41 pengine[5997]:  warning: common_apply_stickiness: Forcing base-clone away from sle11sp41 after 1000000 failures (max=3)
Dec 23 10:53:25 sle11sp41 pengine[5997]:   notice: LogActions: Stop    dlm:0    (sle11sp41)