Closed zhiminghufighting closed 3 years ago
Hi @zhiminghufighting - thanks for raising this!
Please could you paste the output of sudo kata-collect-data.sh
into this issue as that will give us more information on the issue (particularly if you could recreate it with full debug enabled). Please check the output of that script before you paste it, to ensure there is no sensitive information in it too ;)
Sure, i am trying to reproduce it on kata 1.2.0 and check if there still exist. Once issue is reproduced, i will run the .sh and paste the log to you.
/cc @egernst @GabyCT as they have both looked at this area with nginx/ab recently and may have some ideas already.
Do we have any new founds? thanks a lot!
Does it still exist when you use 1.2.0 or master?
Hi @zhiminghufighting - can you provide further details (see https://github.com/kata-containers/runtime/issues/668#issuecomment-417225142)?
I test nginx:latest with Kata runtime master on Centos: 7.4 for more than 1 hour and it works well.
I have tested nginx/ab with 1.2.0 extensively in the past. And did not see any issue. @zhiminghufighting Can you provide more details?
@caoruidong, this issue will not be observed by just running a stable time. Sometimes, kata nginx contianer run more than 12 or 24 hours, you can see nginx by "docker ps", but if you use "docker stop xxxx" to stop it, it will no response; or if you use "docker exec -ti xxxx /bin/bash" and you will fail to enter kata nginx;
This is only observed on centos 7.3 (3.10.0-862) and ubuntu 16.04. It will not be oberserved on Debian 9.
@amshinde New founds: On my centos 7.3(3.10.0-862), this issues will be observed only by running with nginx image(pull from default docker hub). No such issue can be observed when i use kata to launch centos/ubuntu/ images.
@zhiminghufighting have you tried with use_vsock = true
? https://github.com/kata-containers/documentation/blob/master/VSocks.md#system-requirements
@zhiminghufighting Are you still experiencing this issue at all? If not, this issue may be stale and should be closed.
kata will hang or stuck on running nginx on Ubuntu or centos with very high possibility
kata running on host as bare metal mode; Kata runtime version:1.1.0 & 1.2.0 Host OS: Ubuntu: 16.04 Host OS: Centos: 7.3 nginx: 1.15.1 (nginx image is pulled from public image hub of docker.io)
Expected result
nginx running in kata container will never stuck or hang
Actual result
In Centos 7.3 or Ubuntu16.04, kata container will hang or stuck after the nginx server being started more than 1 hour with very high possibility(more than 7 in 10;) "docker stop" can not stop kata container; "docker exec" can not enter kata container; "kill -9" can not kill the kata container/runtime/ process;
docker & kata runtime will not response to any cmd; need to reboot host os to restart the docker and kata;
There is never such issue if we switch runtime to runc to run the same nginx image;
Meta details
Paste the log and verify the same issue on kata 1.2.0:
Running
kata-collect-data.sh
version1.2.0 (commit 0bcb32f)
at2018-08-31.13:53:55.157847694+0800
.Runtime is
/bin/kata-runtime
.kata-env
Output of "
/bin/kata-runtime kata-env
":Runtime config files
Runtime default config files
Runtime config file contents
Config file
/etc/kata-containers/configuration.toml
not found Output of "cat "/usr/share/defaults/kata-containers/configuration.toml"
":Image details
Initrd details
No initrd
Logfiles
Runtime logs
Recent runtime problems found in system journal:
Proxy logs
Recent proxy problems found in system journal:
Shim logs
No recent shim problems found in system journal.
Container manager details
Have
docker
Docker
Output of "
docker version
":Output of "
docker info
":Output of "
systemctl show docker
":Have
kubectl
Kubernetes
Output of "
kubectl version
":Output of "
kubectl config view
":Output of "
systemctl show kubelet
":No
crio
Packages
No
dpkg
Haverpm
Output of "rpm -qa|egrep "(cc-oci-runtimecc-runtimerunv|kata-proxy|kata-runtime|kata-shim|kata-containers-image|linux-container|qemu-)"
":