intel / cri-resource-manager

Kubernetes Container Runtime Interface proxy service with hardware resource aware workload placement policies
Apache License 2.0
175 stars 56 forks source link

got /var/run/cri-resmgr/cri-resmgr-agent.sock no such file issue #1094

Open jongwu opened 6 months ago

jongwu commented 6 months ago

My cri-resmgr version is cri-resmgr version information:

I always get journal log related with cri-resmgr:

cri-resmgr[196816]: W: [ grpc ] [core]grpc: addrConn.createTransport failed to connect to {/var/run/cri-resmgr/cri-resmgr-agent.sock /var/run/cri-resmgr/cri-resmgr-agent.sock 0 }. Err: connection error: desc = "transport: Error while dialing dial unix /var/run/cri-resmgr/cri-resmgr-agent.sock: connect: no such file or directory". Reconnecting...

Yeah, there is really no that agent socket and I have never saw it ever. I think cri-resmgr has not created it. And cri-resmgr doesn't work. When I start it with topolgy-aware plocy, the pod won't bind to numa node.

How to resolve this?