datacenter / ACI-Pre-Upgrade-Validation-Script

A script to run validations to detect potential issues that may cause an ACI fabric upgrade to fail
https://datacenter.github.io/ACI-Pre-Upgrade-Validation-Script/
Apache License 2.0
42 stars 27 forks source link

Need to add manual check for flow control behavior change after an upgrade #31

Closed prkrishn closed 1 year ago

prkrishn commented 2 years ago

If one upgrades ACI leaf switchs from older 13.x to newer 15.2 VPCs connected to devices which have link level flow control in auto/desirable state can go down due to error "vpc port channel mis-config due to vpc links in the 2 switches connected to different partners". Fault F0518 will also be raised.

Root cause is due to ACI software in older release incorrectly signalling far end device which has flow control in auto/desirable state to enable send/transmit flow control. After an upgrade, the behavior is corrected leading to the problem.

Here is an issue which is documented for Catalysts connected to ACI leaf which has more information. CSCvo27498.

monrog2 commented 2 years ago

Check was written for this, currently only resides in test_logic.py until affected versions are verified.

monrog2 commented 2 years ago

4.2(6d) is the version hop which can hit this for VPCs which are 1G:


1. Check if cver/tver hop crosses 4.2(6d)
    if yes, continue

2. Check if any nodes in VPC
    if yes, save node IDS and continue

3. check if any 1G (sfp) connections exist on VPC nodes
    icurl 'http://127.0.0.1:7777/api//class/ethpmFcot.json?query-target-ilter=and(eq(ethpmFcot.type,"sfp"),eq(ethpmFcot.state,"inserted"))'