Open sboschman opened 7 years ago
Given that all errors happened when you were using docker exec
and trying to join cpu, cpuacct
group, the only possibility I can think of is somehow the process got PF_NO_SETAFFINITY
set (usually not possibly in userspace) or the process was set to be an RT process without rt_runtime allocated in the cgroup.
does this relate to https://github.com/opencontainers/runc/issues/1884
I have same error text
"note": "Liveness probe failed: OCI runtime exec failed: exec failed: container_linux.go:348: starting container process caused \"process_linux.go:90: adding pid 27257 to cgrou
ps caused \\\"failed to write 27257 to cgroup.procs: write /sys/fs/cgroup/cpu,cpuacct/kubepods/burstable/poda16dca42-8cfc-11e9-8753-767ef6f517db/443e19668182ba1351c93af648fad2f8
b839990567d5fd4c612c152800888301/cgroup.procs: invalid argument\\\"\": unknown\r\n",
"type": "Warning",
readiness check:
readinessProbe:
exec:
command:
- /bin/sh
- -c
- redis-cli
- -h
- ${POD_IP}
- -p
- "19000"
- ping
failureThreshold: 3
initialDelaySeconds: 5
periodSeconds: 5
successThreshold: 1
timeoutSeconds: 1
kubernetes v1.14.1 os: CentOS Linux release 7.4.1708 (Core) kernel: 4.14.15-1.el7.elrepo.x86_64 docker: 18.06.2-ce (API version: 1.38 (minimum version 1.12)
@chinglinwen Similar situation like you.
Readiness probe failed: rpc error: code = 2 desc = oci runtime error: exec failed: container_linux.go:262: starting container process caused "process_linux.go:86: adding pid 16166 to cgroups caused \"failed to write 16166 to cgroup.procs: write /sys/fs/cgroup/cpu,cpuacct/kubepods.slice/kubepods-burstable.slice/kubepods-burstable-pod6e7e876e_9957_11e9_a845_00163e08cd06.slice/docker-941ddc07fc84ba668df4821403a6b051c85aad4cf6c64153aae0e9a0977d943d.scope/cgroup.procs: invalid argument\
Kernel Version: 3.10.0-693.2.2.el7.x86_64
OS Image: CentOS Linux 7 (Core)
Operating System: linux
Architecture: amd64
Container Runtime Version: docker://17.6.2
Kubelet Version: v1.12.6-aliyun.1
Kube-Proxy Version: v1.12.6-aliyun.1
I face the same problem running on kops -1.11.7
Liveness probe failed: rpc error: code = 2 desc = oci runtime error: exec failed: container_linux.go:247: starting container process caused "process_linux.go:87: adding pid 27268 to cgroups caused \"failed to write 27268 to cgroup.procs: write /sys/fs/cgroup/cpu,cpuacct/kubepods/burstable/pod612392f6-a439-11e9-9830-0e60496b67de/958ab969a162d91c9e58cb9e84db295083dfb3e4aa833e7575d3d042bffce720/cgroup.procs: invalid argument\""
Normal Killing 20m (x9 over 83d) kubelet,
any updates on this issue ?
@ilyesAj do you see this, too? If yes, can you peek into the kernel logs (dmesg
) and see if there's anything from the OOM killer. I suspect this is a race between runc trying to start exec and the kernel killing the exec'ed process.
On our Jenkins CI infrastructure we run Maven builds inside a Docker container. Unfortunately once in a while the build container crashes during the execution of the Maven build with a failure writing a pid to the cgroup.proc file.
I assume the error is thrown from https://github.com/opencontainers/runc/blob/v1.0.0-rc2/libcontainer/cgroups/utils.go#L422 , which boils down to https://github.com/golang/go/blob/master/src/io/ioutil/ioutil.go#L76 and https://github.com/golang/go/blob/master/src/os/file.go#L139