Open yaqiangz opened 2 years ago
For test_crm_nexthop_group
, I observe below log errs:
E Oct 18 14:30:52.643831 bjw-can-720dt-1 ERR syncd#syncd: [none] SAI_API_NEXT_HOP_GROUP:brcm_sai_xgs_nexthop_group_create:73 ecmp nh group create failed with error Table full (0xfffffffa).
E
E Oct 18 14:30:52.643831 bjw-can-720dt-1 ERR syncd#syncd: [none] SAI_API_NEXT_HOP_GROUP:brcm_sai_create_next_hop_group:147 pd nexthop group create failed failed with error -13.
E
E Oct 18 14:30:52.643831 bjw-can-720dt-1 ERR syncd#syncd: :- sendApiResponse: api SAI_COMMON_API_CREATE failed in syncd mode: SAI_STATUS_TABLE_FULL
E
E Oct 18 14:30:52.643991 bjw-can-720dt-1 ERR syncd#syncd: :- processQuadEvent: attr: SAI_NEXT_HOP_GROUP_ATTR_TYPE: SAI_NEXT_HOP_GROUP_TYPE_DYNAMIC_UNORDERED_ECMP
E
E Oct 18 14:30:52.644412 bjw-can-720dt-1 ERR swss#orchagent: :- create: create status: SAI_STATUS_TABLE_FULL
E
E Oct 18 14:30:52.644412 bjw-can-720dt-1 ERR swss#orchagent: :- addNextHopGroup: Failed to create next hop group 2.0.0.1@PortChannel101,2.0.0.130@PortChannel101, rv:-13
E
E Oct 18 14:30:52.644412 bjw-can-720dt-1 ERR swss#orchagent: :- handleSaiCreateStatus: Encountered failure in create operation, exiting orchagent, SAI API: SAI_API_NEXT_HOP_GROUP, status: SAI_STATUS_TABLE_FULL
E
E Oct 18 14:31:54.200837 bjw-can-720dt-1 ERR swss#supervisor-proc-exit-listener: Process 'orchagent' is not running in namespace 'host' (1.0 minutes).
E
E Oct 18 14:32:54.266360 bjw-can-720dt-1 ERR swss#supervisor-proc-exit-listener: Process 'orchagent' is not running in namespace 'host' (2.0 minutes).
For test_crm_fdb_entry
, I observed below logs:
E Oct 18 14:34:54.391196 bjw-can-720dt-1 ERR swss#supervisor-proc-exit-listener: Process 'orchagent' is not running in namespace 'host' (4.0 minutes).
E
E Oct 18 14:35:54.456514 bjw-can-720dt-1 ERR swss#supervisor-proc-exit-listener: Process 'orchagent' is not running in namespace 'host' (5.0 minutes).
The test_crm_route
is likely a SAI issue as the available counter comes from a SAI call.
For test_crm_nexthop_group
, it seems that the test will try to create more nexthop groups than the BCM sdk can handle, hence the "Table full" error that causes orchagent to crash.
Description Test failures seen in 720dt.
Steps to reproduce the issue:
Describe the results you received:
test results 1.1 crm/test_crm.py::test_crm_route failed
1.2 crm/test_crm.py::test_crm_nexthop_group failed
result = {'expect_messages': {'/tmp/syslog.bjw-can-720dt-1.2022-10-11-02:37:27': []}, 'match_files': {'/tmp/syslog.bjw-can-720d...jw-can-720dt-1.2022-10-11-02:37:27': []}, 'total': {'expected_match': 0, 'expected_missing_match': 1, 'match': 0}, ...} result_str = 'expected_match: 0\nexpected_missing_match: 1\nmatch: 0\n\nExpected Messages that are missing:\n. THRESHOLD_EXCEEDED .' self = <tests.common.plugins.loganalyzer.loganalyzer.LogAnalyzer instance at 0x7fa005d9bb40>