kubeedge / sedna

AI tookit over KubeEdge
https://sedna.readthedocs.io
Apache License 2.0
505 stars 164 forks source link

something wrong when deployed the joint_inference example in the edge side #234

Open 15926273249 opened 2 years ago

15926273249 commented 2 years ago

What happened: There is something wrong when deployed the joint_Inference example ,the worker failed in the edge node What you expected to happen:

How to reproduce it (as minimally and precisely as possible): When the service in the cloud node started, howerver, the worker failed in the edge node so that I cannot inference the video in the edge side Anything else we need to know?: When I executed "kubectl describe jointinferenceservices.sedna.io", I got: Status: Active: 1 Conditions: Last Heartbeat Time: 2021-11-09T09:54:10Z Last Transition Time: 2021-11-09T09:54:10Z Status: True Type: Running Last Heartbeat Time: 2021-11-09T09:54:10Z Last Transition Time: 2021-11-09T09:54:10Z Message: the worker of service failed Reason: workerFailed Status: True Type: Failed Failed: 1 Start Time: 2021-11-09T09:54:10Z Events:

Environment:

Sedna Version ```console $ kubectl get -n sedna deploy gm -o jsonpath='{.spec.template.spec.containers[0].image}' # kubeedge/sedna-gm:v0.4.3 $ kubectl get -n sedna ds lc -o jsonpath='{.spec.template.spec.containers[0].image}' #kubeedge/sedna-lc:v0.4.3 ```
Kubernets Version ```console $ kubectl version #Client Version: version.Info{Major:"1", Minor:"20", GitVersion:"v1.20.6", GitCommit:"8a62859e515889f07e3e3be6a1080413f17cf2c3", GitTreeState:"clean", BuildDate:"2021-04-15T03:28:42Z", GoVersion:"go1.15.10", Compiler:"gc", Platform:"linux/amd64"} Server Version: version.Info{Major:"1", Minor:"20", GitVersion:"v1.20.6", GitCommit:"8a62859e515889f07e3e3be6a1080413f17cf2c3", GitTreeState:"clean", BuildDate:"2021-04-15T03:19:55Z", GoVersion:"go1.15.10", Compiler:"gc", Platform:"linux/amd64"} $ cloudcore --version KubeEdge v1.6.1 $ edgecore --version 2021-11-11 10:27:09.400657 I | INFO: Install client plugin, protocol: rest 2021-11-11 10:27:09.400737 I | INFO: Installed service discovery plugin: edge KubeEdge v1.7.2

CloudSide Environment:

Hardware configuration ```console $ lscpu Architecture: x86_64 CPU op-mode(s): 32-bit, 64-bit Byte Order: Little Endian CPU(s): 4 On-line CPU(s) list: 0-3 Thread(s) per core: 1 Core(s) per socket: 1 Socket(s): 4 NUMA node(s): 1 Vendor ID: GenuineIntel CPU family: 6 Model: 60 Model name: Intel Core Processor (Haswell, no TSX) Stepping: 1 CPU MHz: 2399.996 BogoMIPS: 4799.99 Hypervisor vendor: KVM Virtualization type: full L1d cache: 32K L1i cache: 32K L2 cache: 4096K L3 cache: 16384K NUMA node0 CPU(s): 0-3 Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 syscall nx rdtscp lm constant_tsc rep_good nopl xtopology cpuid tsc_known_freq pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm cpuid_fault invpcid_single fsgsbase bmi1 avx2 smep bmi2 erms invpcid xsaveopt arat $ cat /etc/os-release NAME="Ubuntu" VERSION="18.04.6 LTS (Bionic Beaver)" ID=ubuntu ID_LIKE=debian PRETTY_NAME="Ubuntu 18.04.6 LTS" VERSION_ID="18.04" HOME_URL="https://www.ubuntu.com/" SUPPORT_URL="https://help.ubuntu.com/" BUG_REPORT_URL="https://bugs.launchpad.net/ubuntu/" PRIVACY_POLICY_URL="https://www.ubuntu.com/legal/terms-and-policies/privacy-policy" VERSION_CODENAME=bionic UBUNTU_CODENAME=bionic $ uname -a # Linux master1 4.15.0-158-generic #166-Ubuntu SMP Fri Sep 17 19:37:52 UTC 2021 x86_64 x86_64 x86_64 GNU/Linux
Others

EdgeSide Environment:

Hardware configuration ```console $ lscpu # paste output here ``` Architecture: x86_64 CPU op-mode(s): 32-bit, 64-bit Byte Order: Little Endian CPU(s): 4 On-line CPU(s) list: 0-3 Thread(s) per core: 2 Core(s) per socket: 2 Socket(s): 1 NUMA node(s): 1 Vendor ID: GenuineIntel CPU family: 6 Model: 134 Model name: Intel(R) Xeon(R) Platinum 8358P CPU @ 2.60GHz Stepping: 0 CPU MHz: 2593.968 BogoMIPS: 5187.93 Hypervisor vendor: KVM Virtualization type: full L1d cache: 32K L1i cache: 32K L2 cache: 4096K L3 cache: 16384K NUMA node0 CPU(s): 0-3 Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology cpuid tsc_known_freq pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch cpuid_fault invpcid_single ssbd ibrs ibpb fsgsbase bmi1 avx2 smep bmi2 erms invpcid avx512f avx512dq rdseed adx smap avx512ifma clflushopt clwb avx512cd sha_ni avx512bw avx512vl xsaveopt xsavec xgetbv1 arat avx512vbmi umip pku ospke avx512_vbmi2 gfni vaes vpclmulqdq avx512_vnni avx512_bitalg avx512_vpopcntdq la57
OS ```console $ cat /etc/os-release # paste output here ``` NAME="Ubuntu" VERSION="18.04.5 LTS (Bionic Beaver)" ID=ubuntu ID_LIKE=debian PRETTY_NAME="Ubuntu 18.04.5 LTS" VERSION_ID="18.04" HOME_URL="https://www.ubuntu.com/" SUPPORT_URL="https://help.ubuntu.com/" BUG_REPORT_URL="https://bugs.launchpad.net/ubuntu/" PRIVACY_POLICY_URL="https://www.ubuntu.com/legal/terms-and-policies/privacy-policy" VERSION_CODENAME=bionic UBUNTU_CODENAME=bionic
Kernel ```console $ uname -a # paste output here ``` Linux edgenode1 4.15.0-121-generic #123-Ubuntu SMP Mon Oct 5 16:16:40 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux
Others
JoeyHwong-gk commented 2 years ago

Duplicate of #198