sonic-net / sonic-swss

SONiC Switch State Service (SwSS)
https://azure.github.io/SONiC
Other
170 stars 512 forks source link

TD4 - swss not coming up with latest code #3259

Open anilkpan opened 4 weeks ago

anilkpan commented 4 weeks ago

When I load the latest community master image to the z9432f platform, I see the following error. swss is not coming up

root@sonic:/var/log# show logging ERR 2024-08-14T20:41:56.959303+00:00 sonic kernel: [ 6.011437] EDAC MC0: Giving out device to module pnd2_edac controller Pondicherry2: DEV pnd2/dnv (INTERRUPT) 2024-08-14T20:41:56.959303+00:00 sonic kernel: [ 6.011437] EDAC MC0: Giving out device to module pnd2_edac controller Pondicherry2: DEV pnd2/dnv (INTERRUPT) 2024 Aug 14 20:43:14.772812 sonic ERR ntpd[3626]: statistics directory /var/log/ntpsec/ does not exist or is unwriteable, error No such file or directory 2024 Aug 14 20:44:03.762380 sonic ERR monit[8562]: Unix socket /var/run/monit.sock connection error -- No such file or directory 2024 Aug 14 20:44:22.932766 sonic ERR systemd[1]: Failed to start swss.service - switch state service. 2024 Aug 14 20:44:22.935793 sonic ERR featured: ['sudo', 'systemctl', 'start', 'swss.service'] - failed: return code - 1, output:#012None 2024 Aug 14 20:44:22.935939 sonic ERR featured: Feature 'swss.service' failed to be enabled and started 2024 Aug 14 20:45:03.763367 sonic ERR monit[10175]: Unix socket /var/run/monit.sock connection error -- No such file or directory 2024 Aug 14 20:46:03.763162 sonic ERR monit[10683]: Unix socket /var/run/monit.sock connection error -- No such file or directory 2024 Aug 14 20:46:38.967336 sonic ERR systemctl[12041]: Failed to start dhcp_relay.service: Unit dhcp_relay.service is masked. 2024 Aug 14 20:47:05.642241 sonic ERR snmp#snmp-subagent [ax_interface] ERROR: MIBUpdater.start() caught an unexpected exception during update_data()#012Traceback (most recent call last):#012 File "/usr/local/lib/python3.11/dist-packages/ax_interface/mib.py", line 42, in start#012 self.reinit_data()#012 File "/usr/local/lib/python3.11/dist-packages/sonic_ax_impl/mibs/ietf/rfc2737.py", line 322, in reinit_data#012 updater.reinit_data()#012 File "/usr/local/lib/python3.11/dist-packages/sonic_ax_impl/mibs/ietf/rfc2737.py", line 663, in reinit_data#012 self._update_entity_cache(name)#012 File "/usr/local/lib/python3.11/dist-packages/sonic_ax_impl/mibs/ietf/rfc2737.py", line 960, in _update_entity_cache#012 fan_position = int(fan_position)#012 ^^^^^^^^^^^^^^^^^#012ValueError: invalid literal for int() with base 10: '' 2024 Aug 14 20:47:12.647078 sonic ERR snmp#snmp-subagent [ax_interface] ERROR: MIBUpdater.start() caught an unexpected exception during update_data()#012Traceback (most recent call last):#012 File "/usr/local/lib/python3.11/dist-packages/ax_interface/mib.py", line 42, in start#012 self.reinit_data()#012 File "/usr/local/lib/python3.11/dist-packages/sonic_ax_impl/mibs/ietf/rfc2737.py", line 322, in reinit_data#012 updater.reinit_data()#012 File "/usr/local/lib/python3.11/dist-packages/sonic_ax_impl/mibs/ietf/rfc2737.py", line 663, in reinit_data#012 self._update_entity_cache(name)#012 File "/usr/local/lib/python3.11/dist-packages/sonic_ax_impl/mibs/ietf/rfc2737.py", line 960, in _update_entity_cache#012 fan_position = int(fan_position)#012 ^^^^^^^^^^^^^^^^^#012ValueError: invalid literal for int() with base 10: '' 2024 Aug 14 20:47:57.836504 sonic ERR snmp#snmp-subagent message repeated 8 times: [ [ax_interface] ERROR: MIBUpdater.start() caught an unexpected exception during update_data()#012Traceback (most recent call last):#012 File "/usr/local/lib/python3.11/dist-packages/ax_interface/mib.py", line 42, in start#012 self.reinit_data()#012 File "/usr/local/lib/python3.11/dist-packages/sonic_ax_impl/mibs/ietf/rfc2737.py", line 322, in reinit_data#012 updater.reinit_data()#012 File "/usr/local/lib/python3.11/dist-packages/sonic_ax_impl/mibs/ietf/rfc2737.py", line 663, in reinit_data#012 self._update_entity_cache(name)#012 File "/usr/local/lib/python3.11/dist-packages/sonic_ax_impl/mibs/ietf/rfc2737.py", line 960, in _update_entity_cache#012 fan_position = int(fan_position)#012 ^^^^^^^^^^^^^^^^^#012ValueError: invalid literal for int() with base 10: ] 2024 Aug 14 20:48:10.490613 sonic ERR systemctl[15455]: Failed to start dhcp_relay.service: Unit dhcp_relay.service is masked. 2024 Aug 14 20:49:41.606824 sonic ERR systemctl[18579]: Failed to start dhcp_relay.service: Unit dhcp_relay.service is masked. root@sonic:/var/log#

adyeung commented 1 week ago

@jeff-yin pls have someone from DELL to take a look

jeff-yin commented 1 week ago

Can someone assign this to @arunlk-dell ? I don't seem to have privileges to assign issues.

adyeung commented 1 week ago

@jeff-yin probably only the repo maintainer can assign as a formality, Arun can still submit a code PR fix and link with this, the merge of the fix will imply closing of the issue

prsunny commented 5 days ago

@arunlk-dell is not listed in assignee list. I cannot assig. @jeff-yin , i'll assign this to you for now