canonical / sdcore-udr-k8s-operator

A Charmed Operator for SD-Core's Unified Data Repository (UDR) component.
https://charmhub.io/sdcore-udr-k8s
Apache License 2.0
0 stars 0 forks source link

Charmed-5G breaks after UDR pod crashes #30

Closed ghislainbourgeois closed 5 months ago

ghislainbourgeois commented 9 months ago

Deleting the UDR pod and waiting for it to come back in Active/Idle results in a failing simulation.

This sequence from chaos testing produced the issue:

ERROR    test_integration:test_integration.py:84 Failed after deleting udr-0
ERROR    test_integration:test_integration.py:85 Successes:
['smf-0', 'webui-0', 'webui-0', 'ausf-0']
gatici commented 9 months ago

UDR still uses old NRF IP after NRF pod restart:

nrf                                  ClusterIP      10.152.183.158   <none>        65535/TCP,29510/TCP           25h   app.kubernetes.io/name=nrf

nrf-0                           2/2     Running   4 (55m ago)    24h   10.1.146.9    gaticipc   <none>           <none>

"upf-external"}} AppFilters:{PccRuleBase:{FlowInfos:{FlowDesc:"permit out ip from any to assigned" TosTrafficClass:"IPV4" FlowDir:BIDIRECTIONAL} RuleId:"DefaultRule" Qos:{Var5qi:9 Arp:{PL:1 PC:MAY_PREEMPT PV:PREEMPTABLE}} Priority:255}}} ConfigUpdated:1
2023-11-22T14:30:21.938Z [udr] 2023-11-22T14:30:21Z [INFO][UDR][GRPC] Network Slice Name  default
2023-11-22T14:30:21.938Z [udr] 2023-11-22T14:30:21Z [INFO][UDR][GRPC] Network Slice has site name present
2023-11-22T14:30:21.938Z [udr] 2023-11-22T14:30:21Z [INFO][UDR][GRPC] Site name  demo
2023-11-22T14:30:21.938Z [udr] 2023-11-22T14:30:21Z [INFO][UDR][GRPC] Plmn mcc  208
2023-11-22T14:30:21.938Z [udr] 2023-11-22T14:30:21Z [INFO][UDR][Init] Config update DB trigger
2023-11-22T14:30:21.938Z [udr] 2023-11-22T14:30:21Z [INFO][UDR][CFG] AddEntrySmPolicyTable
2023-11-22T14:30:21.938Z [udr] 2023-11-22T14:30:21Z [INFO][UDR][CFG] collname, imsi, dnn, sst, sd :  policyData.ues.smData 208930100007487 internet 1 010203
2023-11-22T14:30:21.939Z [udr] 2023-11-22T14:30:21Z [INFO][UDR][CFG] entry in DB key   01010203
2023-11-22T14:30:21.939Z [udr] 2023-11-22T14:30:21Z [INFO][UDR][CFG] entry in DB val   {0xc000464c48 map[internet:{internet [] []   false false 0 0 false false <nil> map[] false false 0}]}
2023-11-22T14:30:21.942Z [udr] 2023-11-22T14:30:21Z [INFO][UDR][Init] added entry to sm policy table success
2023-11-22T14:30:21.943Z [udr] 2023-11-22T14:30:21Z [INFO][Config5g][GRPC] stream msg recieved
2023-11-22T14:30:21.943Z [udr] 2023-11-22T14:30:21Z [INFO][Config5g][GRPC] First time config Received  RestartCounter:1482873020 NetworkSlice:{Name:"default" Nssai:{Sst:"1" Sd:"010203"} DeviceGroup:{Name:"cows" IpDomainDetails:{Name:"pool1" DnnName:"internet" UePool:"172.250.0.0/16" DnsPrimary:"8.8.8.8" Mtu:1450 UeDnnQos:{DnnMbrUplink:2000000 DnnMbrDownlink:2000000 TrafficClass:{Name:"platinum" Qci:8 Arp:6 Pdb:300 Pelr:6}}} Imsi:"208930100007487"} Site:{SiteName:"demo" Gnb:{Name:"demo-gnb1" Tac:1} Plmn:{mcc:"208" mnc:"93"} Upf:{UpfName:"upf-external"}} AppFilters:{PccRuleBase:{FlowInfos:{FlowDesc:"permit out ip from any to assigned" TosTrafficClass:"IPV4" FlowDir:BIDIRECTIONAL} RuleId:"DefaultRule" Qos:{Var5qi:9 Arp:{PL:1 PC:MAY_PREEMPT PV:PREEMPTABLE}} Priority:255}}} ConfigUpdated:1
2023-11-22T14:30:29.619Z [udr] UDR register to NRF Error[Put "https://10.1.146.25:29510/nnrf-nfm/v1/nf-instances/f2a3fc58-5cec-4b84-a3b9-905b5f3a730c": dial tcp 10.1.146.25:29510: connect: connection timed out]
2023-11-22T14:30:29.619Z [udr] UDR register to NRF Error[Put "https://10.1.146.25:29510/nnrf-nfm/v1/nf-instances/f2a3fc58-5cec-4b84-a3b9-905b5f3a730c": dial tcp 10.1.146.25:29510: connect: connection timed out]
2023-11-22T14:32:42.743Z [udr] UDR register to NRF Error[Put "https://10.1.146.25:29510/nnrf-nfm/v1/nf-instances/f2a3fc58-5cec-4b84-a3b9-905b5f3a730c": dial tcp 10.1.146.25:29510: connect: connection timed out]
2023-11-22T14:32:42.743Z [udr] UDR register to NRF Error[Put "https://10.1.146.25:29510/nnrf-nfm/v1/nf-instances/f2a3fc58-5cec-4b84-a3b9-905b5f3a730c": dial tcp 10.1.146.25:29510: connect: connection timed out]
gruyaume commented 6 months ago

@ghislainbourgeois is this issue still valid?