sonic-net / sonic-mgmt

Configuration management examples for SONiC
Other
196 stars 716 forks source link

[test gap][dualtor-as] dualtor linked with ycable should recover from inconsistency when cable firmware is corrupted #12311

Open zjswhhh opened 6 months ago

zjswhhh commented 6 months ago

Description It's noticed that in production cable can go bad and give false values on the EEPROM, but data plane stays alive.

If mux toggles happen, ToRs can end up as both side active, and unable to recover. The issue should be fixed by https://github.com/sonic-net/sonic-linkmgrd/pull/184/files.

But there is a test gap here.

Steps to reproduce the issue:

Describe the results you received:

Describe the results you expected:

Additional information you deem important:

**Output of `show version`:**

```
(paste your output here)
```

**Attach debug file `sudo generate_dump`:**

```
(paste your output here)
```
StormLiangMS commented 4 months ago

hi @prgeor this one seems a platform related test gap, assigned to you for further triage.

vdahiya12 commented 4 months ago
  1. to simulate Case 1, we can see if stopping ycabled/mux_sumulator combination can lead to achieve the case. we can add the test case in DualToR test cases if this test is not.
  2. for case 2 we can add fixture to mux-simulator to return link down for sometime and that should achieve the desired sceenario. During teardown we can restore the link state .
zjswhhh commented 3 months ago

Hi @StormLiangMS - I think it's okay to lower the priority of this item. The gap is a very specific scenario, image fix has checked in long time ago, the issue is not impacting production. Can you help put the right label on?

StormLiangMS commented 3 months ago

hi @yxieca @zjswhhh Prince told me this would be owned by you, changed the owner and set to P1.