xcat2 / xcat-extensions

Repos to store scripts for special user cases
4 stars 8 forks source link

Run xcatha.py will break ssh session #11

Closed neo954 closed 5 years ago

neo954 commented 6 years ago

This bug is against xcatha.py commit 3c0251fe0763f597f098bd16a4d4542c3fe8b9c8.

Host c910f03c01p07 has the following network configuration.

# ip addr list
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN group default qlen 1000
    link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
    inet 127.0.0.1/8 scope host lo
       valid_lft forever preferred_lft forever
    inet6 ::1/128 scope host
       valid_lft forever preferred_lft forever
2: eth0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast state UNKNOWN group default qlen 1000
    link/ether ba:60:85:5b:18:03 brd ff:ff:ff:ff:ff:ff
    inet 10.3.1.7/8 brd 10.255.255.255 scope global eth0
       valid_lft forever preferred_lft forever
    inet6 fe80::b860:85ff:fe5b:1803/64 scope link
       valid_lft forever preferred_lft forever
3: eth1: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast state UNKNOWN group default qlen 1000
    link/ether ba:60:85:5b:18:04 brd ff:ff:ff:ff:ff:ff
    inet6 fe80::b860:85ff:fe5b:1804/64 scope link
       valid_lft forever preferred_lft forever

While run xcatha.py with the following command line arguments. It try to replace the IP address of the backbone network interface to 10.3.1.99. And this network reconfiguration break the ssh session used to run xcatha.py.

# ./xcatha.py -s -p /media/u/gongjie/ha-test -i eth0 -v 10.3.1.99 -n c910f03c01p99 -m 255.0.0.0 -t sqlite
============================================================================================
2018-06-06 01:12:13,740 - INFO - Check virtual ip stage
2018-06-06 01:12:13,740 - INFO - ping -c 1 -w 10 10.3.1.99
PING 10.3.1.99 (10.3.1.99) 56(84) bytes of data.
From 10.3.1.7 icmp_seq=1 Destination Host Unreachable
From 10.3.1.7 icmp_seq=2 Destination Host Unreachable
From 10.3.1.7 icmp_seq=3 Destination Host Unreachable

--- 10.3.1.99 ping statistics ---
3 packets transmitted, 0 received, +3 errors, 100% packet loss, time 2005ms
pipe 3
2018-06-06 01:12:16,738 - INFO - virtual ip can be used.
============================================================================================
2018-06-06 01:12:16,738 - INFO - Check if xCAT data is in shared data directory
2018-06-06 01:12:16,739 - INFO - There is xCAT data /media/u/gongjie/ha-test/install in shared data /media/u/gongjie/ha-test
============================================================================================
2018-06-06 01:12:16,739 - INFO - Check if target dbtype is the same with shared data dbtype stage
2018-06-06 01:12:16,740 - INFO - Database type is 'sqlite' in shared data directory
2018-06-06 01:12:16,740 - INFO - Target database type is matched [Passed]
============================================================================================
2018-06-06 01:12:16,740 - INFO - Configure virtual ip as alias ip stage

The expected behavior is keep the IP address of the backbone network interface. And apply the"virtual IP" as an alias on the same network adapter.

neo954 commented 6 years ago

And I consider this bug as a BLOCK issue.

robin2008 commented 6 years ago

@bybai It seems reasonable, the tool should not break the ssh session.

bybai commented 6 years ago

This is a accepted bug, but not a BLOCK issue. We advise to use alias nic like "-i eth0:0" instead of "-i eth0".

robin2008 commented 6 years ago

If it is mentioned as you, Just print error to ask ask user to input eth0:0and quit without continuing, we cannot go ahead if the wrong input will cause big issue.

bybai commented 6 years ago

Hi @gurevichmark , could you look at this issue?

neo954 commented 6 years ago

Refer to xcat2/xcat2-task-management#163

zet809 commented 6 years ago

Is this issue fixed? If not, pls move to 2.14.3, if yes, pls close this defect.

gurevichmark commented 6 years ago

Since we now ask admin to setup VIP manually. I do not think this issue is valid anymore.

zet809 commented 6 years ago

Hi, @neo954 , as @gurevichmark have mentioned above, what is your idea?

gurevichmark commented 5 years ago

No updates since July. Closing this issue.