Open justindthomas opened 1 year ago
Is it possible that this PR might address this? https://github.com/sonic-net/sonic-buildimage/pull/17044
I re-added the management VRF settings to see if that PR I mentioned above solved the issue. It does not.
2023-11-21 23:42:04,907 - supervisord_dependent_startup - [INFO ] New event: Service snmpd went from BACKOFF to STARTING
2023-11-21 23:42:04,940 - supervisord_dependent_startup - [INFO ] Services:
2023-11-21 23:42:04,954 - supervisord_dependent_startup - [INFO ] - rsyslogd RUNNING dependent_startup: True priority: 1
2023-11-21 23:42:04,970 - supervisord_dependent_startup - [INFO ] - start EXITED dependent_startup: False wait_for: 'rsyslogd:RUNNING' priority: 1
2023-11-21 23:42:04,986 - supervisord_dependent_startup - [INFO ] - containercfgd RUNNING dependent_startup: True wait_for: 'rsyslogd:RUNNING' priority: 99
2023-11-21 23:42:04,992 - supervisord_dependent_startup - [INFO ] - snmpd STARTING dependent_startup: True wait_for: 'start:EXITED' priority: 3
2023-11-21 23:42:05,022 - supervisord_dependent_startup - [INFO ] - snmp-subagent STOPPED dependent_startup: True wait_for: 'snmpd:RUNNING' priority: 4
2023-11-21 23:42:05,044 - supervisord_dependent_startup - [INFO ] Services not yet running (2): snmpd, snmp-subagent
2023-11-21 23:42:06,210 - supervisord_dependent_startup - [INFO ]
2023-11-21 23:42:06,211 - supervisord_dependent_startup - [INFO ] New event: Service snmpd went from STARTING to BACKOFF
2023-11-21 23:42:06,230 - supervisord_dependent_startup - [INFO ] Services:
2023-11-21 23:42:06,233 - supervisord_dependent_startup - [INFO ] - rsyslogd RUNNING dependent_startup: True priority: 1
2023-11-21 23:42:06,246 - supervisord_dependent_startup - [INFO ] - start EXITED dependent_startup: False wait_for: 'rsyslogd:RUNNING' priority: 1
2023-11-21 23:42:06,253 - supervisord_dependent_startup - [INFO ] - containercfgd RUNNING dependent_startup: True wait_for: 'rsyslogd:RUNNING' priority: 99
2023-11-21 23:42:06,262 - supervisord_dependent_startup - [INFO ] - snmpd FATAL dependent_startup: True wait_for: 'start:EXITED' priority: 3
2023-11-21 23:42:06,264 - supervisord_dependent_startup - [INFO ] - snmp-subagent STOPPED dependent_startup: True wait_for: 'snmpd:RUNNING' priority: 4
2023-11-21 23:42:06,272 - supervisord_dependent_startup - [INFO ] Services not yet running (2): snmpd, snmp-subagent
2023-11-21 23:42:06,274 - supervisord_dependent_startup - [INFO ]
2023-11-21 23:42:06,274 - supervisord_dependent_startup - [INFO ] New event: Service snmpd went from BACKOFF to FATAL
2023-11-21 23:42:06,294 - supervisord_dependent_startup - [INFO ] Services:
2023-11-21 23:42:06,297 - supervisord_dependent_startup - [INFO ] - rsyslogd RUNNING dependent_startup: True priority: 1
2023-11-21 23:42:06,302 - supervisord_dependent_startup - [INFO ] - start EXITED dependent_startup: False wait_for: 'rsyslogd:RUNNING' priority: 1
2023-11-21 23:42:06,305 - supervisord_dependent_startup - [INFO ] - containercfgd RUNNING dependent_startup: True wait_for: 'rsyslogd:RUNNING' priority: 99
2023-11-21 23:42:06,310 - supervisord_dependent_startup - [INFO ] - snmpd FATAL dependent_startup: True wait_for: 'start:EXITED' priority: 3
2023-11-21 23:42:06,312 - supervisord_dependent_startup - [INFO ] - snmp-subagent STOPPED dependent_startup: True wait_for: 'snmpd:RUNNING' priority: 4
2023-11-21 23:42:06,324 - supervisord_dependent_startup - [INFO ] Services not yet running (1): snmp-subagent
jdt@sonic:~$ sudo show system-health summary
System status summary
System status LED blink_yellow
Services:
Status: Not OK
Not Running: container_checker, telemetry, snmp:snmpd, snmp:snmp-subagent
Hardware:
Status: OK
The telemetry error is new, but unrelated to the management VRF (AFAIK) and I think that's being addressed in a separate issue.
This may be related to https://github.com/sonic-net/sonic-buildimage/issues/16187
Description
Using a Dell N3248TE-ON, I configured the management interface using the
mgmt
VRF. The SNMP container stopped working after that (although I didn't notice it immediately).Steps to reproduce the issue:
Describe the results you received:
I only have one PSU, so you can ignore that message below.
...in the SNMP container. 10.200.0.2 is the IP address of my eth0 interface in the mgmt VRF.
Describe the results you expected:
The SNMP container to continue operating.
Output of
show version
:I use
sudo show version
because it complains aboutdmidecode
if I don't - but that's a separate issue.Output of
show techsupport
:techsupport.txt
Additional information you deem important (e.g. issue happens only occasionally):
The dump file is 60MB and exceeds GitHub's limits.