Bug: pod restarted after containerd restarted when container IO is huge

containerd / rust-extensions

Rust crates to extend containerd

https://containerd.io

Apache License 2.0

184 stars 73 forks source link

Bug: pod restarted after containerd restarted when container IO is huge #218

Open Burning1020 opened 1 year ago

Burning1020 commented 1 year ago

We have found a bug that pods restarted after containerd restarted, especially on the condition that the container IO is very frequent. The parent process of the container processm that is our async containerd -shim process is not very careful to handle the pipe of container process IO. Thus when containerd restarts, some of containers, especially that has huge IO requests, would receive EPIPE and exited unexpectly.

Burning1020 commented 1 year ago

We have to do some code optimization in handling container IO pipe in the async shim-runc crate, I can fix this lately.