After upgrade from 202305 to 202311, "telemetry" feature remain "enabled" on config_db although "telemetry" docker is no longer exist (gnmi docker replaced it).
Due to that, system is declared as "not ready":
admin@r-panther-13:~$ show system-health sysready-status
System is not ready - one or more services are not up
Steps to reproduce the issue:
Install (from ONIE) 202305 SONiC image. (SONiC.202305.555792-561bb5420)
Make sure "telemetry" feature is enabled, run "config save -y" to have it saved on config_db.json
Install 202311 image ( SONiC.202311.555802-ef65c9653) and reboot
After system is up Run command: "sudo show system-health detail" and se that telemetry service appear as "Not OK".
Describe the results you received:
admin@r-panther-13:~$ show feature status
Feature State AutoRestart SetOwner
-------------- --------------- -------------- ----------
bgp enabled enabled
database always_enabled always_enabled
dhcp_relay disabled enabled local
eventd enabled enabled
gnmi enabled enabled
lldp enabled enabled
macsec disabled enabled local
mgmt-framework enabled enabled
mux always_disabled enabled
nat disabled enabled
pmon enabled enabled
radv enabled enabled
sflow disabled enabled
snmp enabled enabled
swss enabled enabled
syncd enabled enabled
teamd enabled enabled
telemetry enabled enabled
admin@r-panther-13:~$ redis-cli -n 4 hgetall "FEATURE|telemetry"
1) "auto_restart"
2) "enabled"
3) "delayed"
4) "True"
5) "has_global_scope"
6) "True"
7) "has_per_asic_scope"
8) "False"
9) "high_mem_alert"
10) "disabled"
11) "state"
12) "enabled"
13) "support_syslog_rate_limit"
14) "true"
admin@r-panther-13:~$ show system-health sysready-status
System is not ready - one or more services are not up
Service-Name Service-Status App-Ready-Status Down-Reason
---------------------- ---------------- ------------------ -------------
auditd OK OK -
bgp OK OK -
caclmgrd OK OK -
config-chassisdb OK OK -
config-setup OK OK -
containerd OK OK -
cron OK OK -
database OK OK -
determine-reboot-cause Starting Starting -
docker OK OK -
eventd OK OK -
gnmi OK OK -
hw-management OK OK -
hw-management-tc OK OK -
kdump-tools OK OK -
lldp OK OK -
mgmt-framework OK OK -
netfilter-persistent OK OK -
ntp OK OK -
nv-syncd-shared OK OK -
pmon OK OK -
procdockerstatsd OK OK -
radv OK OK -
ras-mc-ctl OK OK -
rsyslog OK OK -
smartmontools OK OK -
snmp OK OK -
ssh OK OK -
swss OK OK -
syncd OK OK -
sysstat OK OK -
teamd OK OK -
admin@r-panther-13:~$ sudo show system-health detail
System status summary
System status LED red
Services:
Status: Not OK
Not Running: telemetry
Hardware:
Status: OK
System services and devices monitor list
Name Status Type
--------------------- -------- ----------
telemetry Not OK Service
sonic OK System
rsyslog OK Process
root-overlay OK Filesystem
var-log OK Filesystem
routeCheck OK Program
dualtorNeighborCheck OK Program
diskCheck OK Program
container_checker OK Program
vnetRouteCheck OK Program
memory_check OK Program
container_memory_snmp OK Program
container_memory_gnmi OK Program
container_eventd OK Program
eventd:eventd OK Process
database:redis OK Process
syncd:syncd OK Process
bgp:zebra OK Process
bgp:staticd OK Process
bgp:bgpd OK Process
bgp:fpmsyncd OK Process
bgp:bgpcfgd OK Process
teamd:teammgrd OK Process
teamd:teamsyncd OK Process
teamd:tlm_teamd OK Process
snmp:snmpd OK Process
snmp:snmp-subagent OK Process
lldp:lldpd OK Process
lldp:lldp-syncd OK Process
lldp:lldpmgrd OK Process
gnmi:gnmi-native OK Process
ASIC OK ASIC
fan1 OK Fan
fan2 OK Fan
fan3 OK Fan
fan4 OK Fan
fan5 OK Fan
fan6 OK Fan
fan7 OK Fan
fan8 OK Fan
psu1_fan1 OK Fan
psu2_fan1 OK Fan
PSU 1 OK PSU
PSU 2 OK PSU
System services and devices ignore list
Name Status Type
----------- -------- ------
psu.voltage Ignored Device
Describe the results you expected:
telemetry feature is not enabled, not exist on config_db and system health is ready:
admin@r-panther-13:~$ show system-health sysready-status
System is ready
Description
After upgrade from 202305 to 202311, "telemetry" feature remain "enabled" on config_db although "telemetry" docker is no longer exist (gnmi docker replaced it). Due to that, system is declared as "not ready":
Steps to reproduce the issue:
Describe the results you received:
Describe the results you expected:
telemetry feature is not enabled, not exist on config_db and system health is ready:
Output of
show version
:Before upgrade:
After upgrade:
Output of
show techsupport
:Additional information you deem important (e.g. issue happens only occasionally):