Closed hungta closed 8 years ago
Did you enable heartbeat in clTransport.xml? Please take a look at example: https://github.com/OpenClovis/SAFplus-Availability-Scalability-Platform/tree/master/src/examples/cloud_example
Yes, the heartbeat was defined in clTransport.xml: ` <?xml version="1.0" encoding="UTF-8" standalone="no"?>
`
The clTransport.xml is not correct:
Please take a look at cloud_example model.
I removed the
On SCNodeI1: [aspinfo@SCNodeI1]==> nodes NODE CLASS AS CAS PS OS INSTANTIABLE CLUSTER-MEMBER ISU ASU SCNodeI0 B UL UL I E Y Y 1 1 SCNodeI1 B UL UL I E Y Y 1 1 [aspinfo@SCNodeI1]==> cluster NODE-NAME NODE-TYPE HA-STATE NODE-ADDR SCNodeI0 controller active 1 SCNodeI1 controller standby 2 <-- this node
Then, killed the amf pid on SCNodeI0. A few moment later, the SCNodeI0 came up but seem that 2 nodes didn't talk each other: On SCNodeI0: [aspinfo@SCNodeI0]==> nodes NODE CLASS AS CAS PS OS INSTANTIABLE CLUSTER-MEMBER ISU ASU SCNodeI0 B UL UL I E Y Y 1 1 SCNodeI1 B UL UL U D N N 0 0 [aspinfo@SCNodeI0]==> cluster NODE-NAME NODE-TYPE HA-STATE NODE-ADDR SCNodeI0 controller active 1 <-- this node
On SCNodeI1: aspinfo@SCNodeI1]==> nodes NODE CLASS AS CAS PS OS INSTANTIABLE CLUSTER-MEMBER ISU ASU SCNodeI0 B UL UL U D N N 0 0 SCNodeI1 B UL UL I E Y Y 1 1 [aspinfo@SCNodeI1]==> cluster NODE-NAME NODE-TYPE HA-STATE NODE-ADDR SCNodeI1 controller active 2 <-- this node
Please check the configure:
`
<peer addr="192.168.6.144"/>
</peerAddresses>
` 192.168.56.143 and 192.168.6.144 - both have to communicate to each other
They can communicate to each other, of course
Please attach asp.conf, ifconfig's output results at both node. I guess the issue related to configure since I don't see the issue as well
Also, please give a try by reduce heartbeat interval value to 1000 (1second)
SCNodeI0:
root@ubuntu:~/udptest# ifconfig
eth0 Link encap:Ethernet HWaddr 08:00:27:ce:ff:a4
inet addr:10.20.18.108 Bcast:10.20.18.255 Mask:255.255.255.0
inet6 addr: fe80::a00:27ff:fece:ffa4/64 Scope:Link
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
RX packets:125519 errors:0 dropped:0 overruns:0 frame:0
TX packets:4487 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:1000
RX bytes:18259196 (18.2 MB) TX bytes:495011 (495.0 KB)
eth1 Link encap:Ethernet HWaddr 08:00:27:f3:05:e0
inet addr:192.168.56.143 Bcast:192.168.56.255 Mask:255.255.255.0
inet6 addr: fe80::a00:27ff:fef3:5e0/64 Scope:Link
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
RX packets:183645 errors:0 dropped:0 overruns:0 frame:0
TX packets:125656 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:1000
RX bytes:121421145 (121.4 MB) TX bytes:22219556 (22.2 MB)
eth1:11 Link encap:Ethernet HWaddr 08:00:27:f3:05:e0
inet addr:169.254.100.1 Bcast:169.254.100.255 Mask:255.255.255.0
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
lo Link encap:Local Loopback
inet addr:127.0.0.1 Mask:255.0.0.0
inet6 addr: ::1/128 Scope:Host
UP LOOPBACK RUNNING MTU:65536 Metric:1
RX packets:94656 errors:0 dropped:0 overruns:0 frame:0
TX packets:94656 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:0
RX bytes:15790963 (15.7 MB) TX bytes:15790963 (15.7 MB)
asp.conf:
export NODENAME=SCNodeI0
export DEFAULT_NODEADDR=1
#
export AUTO_ASSIGN_NODEADDR=
export SAHPI_UNSPECIFIED_DOMAIN_ID=UNDEFINED
export OPENHPI_CONF="${ASP_DIR}/etc/openhpi.conf"
export MIBDIRS="${ASP_DIR}/share/snmp/mibs"
export SNMP_TRAP_ADDR=127.0.0.1
export LINK_NAME=eth1
export TIPC_NETID=1340
#
#
#
#
export ASP_SIMULATION=0
export SYSTEM_CONTROLLER=1
export ASP_VALGRIND_CMD=""
export CL_LOG_STREAM_ENABLE=DEBUG export ASP_UDP_USE_EXISTING_IP=true export ASP_UDP_LINK_NAME=eth1
SCNodeI1:
root@ubuntu:~/udptest# ifconfig
eth0 Link encap:Ethernet HWaddr 08:00:27:ad:7a:aa
inet addr:10.20.18.96 Bcast:10.20.18.255 Mask:255.255.255.0
inet6 addr: fe80::a00:27ff:fead:7aaa/64 Scope:Link
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
RX packets:126195 errors:0 dropped:0 overruns:0 frame:0
TX packets:3689 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:1000
RX bytes:18323178 (18.3 MB) TX bytes:420944 (420.9 KB)
eth1 Link encap:Ethernet HWaddr 08:00:27:c6:d6:cf
inet addr:192.168.56.144 Bcast:192.168.56.255 Mask:255.255.255.0
inet6 addr: fe80::a00:27ff:fec6:d6cf/64 Scope:Link
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
RX packets:184072 errors:0 dropped:95 overruns:0 frame:0
TX packets:102100 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:1000
RX bytes:119676854 (119.6 MB) TX bytes:20467196 (20.4 MB)
eth1:12 Link encap:Ethernet HWaddr 08:00:27:c6:d6:cf
inet addr:169.254.100.2 Bcast:169.254.100.255 Mask:255.255.255.0
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
lo Link encap:Local Loopback
inet addr:127.0.0.1 Mask:255.0.0.0
inet6 addr: ::1/128 Scope:Host
UP LOOPBACK RUNNING MTU:65536 Metric:1
RX packets:83087 errors:0 dropped:0 overruns:0 frame:0
TX packets:83087 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:0
RX bytes:17083232 (17.0 MB) TX bytes:17083232 (17.0 MB)
asp.conf:
export NODENAME=SCNodeI1
export DEFAULT_NODEADDR=2
#
export AUTO_ASSIGN_NODEADDR=
export SAHPI_UNSPECIFIED_DOMAIN_ID=UNDEFINED
export OPENHPI_CONF="${ASP_DIR}/etc/openhpi.conf"
export MIBDIRS="${ASP_DIR}/share/snmp/mibs"
export SNMP_TRAP_ADDR=127.0.0.1
export LINK_NAME=eth1
export TIPC_NETID=1340
#
#
#
#
export ASP_SIMULATION=0
export SYSTEM_CONTROLLER=1
export ASP_VALGRIND_CMD=""
export CL_LOG_STREAM_ENABLE=DEBUG export ASP_UDP_USE_EXISTING_IP=true export ASP_UDP_LINK_NAME=eth1
Yeah, that's why I asked you check the configure:
<peerAddresses port="6799"> <peer addr="192.168.56.143"/> <peer addr="192.168.6.144"/> </peerAddresses>
It should change to:
<peerAddresses port="6799"> <peer addr="192.168.56.143"/> <peer addr="192.168.56.144"/> </peerAddresses>
Yes. It's fault in udp configuration. Additionally, reducing the hearbeat interval makes the failover to go faster. Thanks
Configure the model using UDP transport. Start 2 SC Nodes. They were up after that Killed safplus_amf pid on the active node. SAFplus never comes up after that