jenkinsci / kubernetes-cd-plugin

A Jenkins plugin to deploy to Kubernetes cluster
MIT License
140 stars 72 forks source link

waiting for agent to connect (30/100): jenkins-slave-7w7lp #156

Closed rranjith406 closed 3 years ago

rranjith406 commented 3 years ago

I'm facing something similar error. I've my Jenkins master and agent running on K8s pod. The agent is getting provisioned successfully, when It tried to connect to Master, it couldn't. Agent pod will wait for 100s and get terminated. Can I get some help on this..

Capture1 Agent log Capture2

This is my Master log

`2021-05-28 08:59:04.807+0000 [id=37] INFO hudson.slaves.NodeProvisioner#lambda$update$6: jenkins-slave-7w7lp provisioning successfully completed. We have now 2 computer(s) 2021-05-28 08:59:04.938+0000 [id=2793] INFO o.c.j.p.k.KubernetesLauncher#launch: Created Pod: kubernetes infra/jenkins-slave-7w7lp 2021-05-28 08:59:09.399+0000 [id=4211] INFO j.b.MultiBranchProject$BranchIndexing#run: OS Pipeline BE #202100 branch indexing action completed: SUCCESS in 8.5 sec 2021-05-28 08:59:16.024+0000 [id=2793] INFO o.c.j.p.k.KubernetesLauncher#launch: Pod is running: kubernetes infra/jenkins-slave-7w7lp 2021-05-28 08:59:46.248+0000 [id=2793] INFO o.c.j.p.k.KubernetesLauncher#launch: Waiting for agent to connect (30/100): jenkins-slave-7w7lp 2021-05-28 09:00:16.578+0000 [id=2793] INFO o.c.j.p.k.KubernetesLauncher#launch: Waiting for agent to connect (60/100): jenkins-slave-7w7lp 2021-05-28 09:00:46.782+0000 [id=2793] INFO o.c.j.p.k.KubernetesLauncher#launch: Waiting for agent to connect (90/100): jenkins-slave-7w7lp 2021-05-28 09:00:56.898+0000 [id=2793] SEVERE o.c.j.p.k.KubernetesLauncher#logLastLines: Error in provisioning; agent=KubernetesSlave name: jenkins-slave-7w7lp, template=PodTemplate{id='220600f0-4c60-4eec-9f80-a871414cfc2e', name='jenkins-slave', namespace='infra', nodeSelector='intent=apps,lifecycle=Ec2Spot', nodeUsageMode=NORMAL, volumes=[org.csanchez.jenkins.plugins.kubernetes.volumes.PersistentVolumeClaim@74f6bd9d, SecretVolume [mountPath=/mnt/keys, secretName=tmos-cert, defaultMode=256, optional=false]], containers=[ContainerTemplate{name='jnlp', image='123456789.dkr.ecr.region-1.amazonaws.com/jenkins-agent:v.1', privileged=true, alwaysPullImage=true, workingDir='/home/jenkins/agent', command='', args='', ttyEnabled=true, resourceRequestCpu='', resourceRequestMemory='', resourceRequestEphemeralStorage='', resourceLimitCpu='3000m', resourceLimitMemory='6000Mi', resourceLimitEphemeralStorage='', livenessProbe=ContainerLivenessProbe{execArgs='', timeoutSeconds=0, initialDelaySeconds=0, failureThreshold=0, periodSeconds=0, successThreshold=0}}]}. Container jnlp. Logs: about to fork child process, waiting until server is ready for connections. forked process: 11 child process started successfully, parent exiting fill mongo db ... add git key for authenticatoin and user informations ... mkdir: cannot create directory '/root/.ssh': File exists

2021-05-28 09:00:56.899+0000 [id=2793] WARNING o.c.j.p.k.KubernetesLauncher#launch: Error in provisioning; agent=KubernetesSlave name: jenkins-slave-7w7lp, template=PodTemplate{id='220600f0-4c60-4eec-9f80-a871414cfc2e', name='jenkins-slave', namespace='infra', nodeSelector='intent=apps,lifecycle=Ec2Spot', nodeUsageMode=NORMAL, volumes=[org.csanchez.jenkins.plugins.kubernetes.volumes.PersistentVolumeClaim@74f6bd9d, SecretVolume [mountPath=/mnt/keys, secretName=tmos-cert, defaultMode=256, optional=false]], containers=[ContainerTemplate{name='jnlp', image='123456789.dkr.ecr.region-1.amazonaws.com/jenkins-agent:v.1', privileged=true, alwaysPullImage=true, workingDir='/home/jenkins/agent', command='', args='', ttyEnabled=true, resourceRequestCpu='', resourceRequestMemory='', resourceRequestEphemeralStorage='', resourceLimitCpu='3000m', resourceLimitMemory='6000Mi', resourceLimitEphemeralStorage='', livenessProbe=ContainerLivenessProbe{execArgs='', timeoutSeconds=0, initialDelaySeconds=0, failureThreshold=0, periodSeconds=0, successThreshold=0}}]} java.lang.IllegalStateException: Agent is not connected after 100 seconds, status: Running at org.csanchez.jenkins.plugins.kubernetes.KubernetesLauncher.launch(KubernetesLauncher.java:244) at hudson.slaves.SlaveComputer.lambda$_connect$0(SlaveComputer.java:294) at jenkins.util.ContextResettingExecutorService$2.call(ContextResettingExecutorService.java:46) at jenkins.security.ImpersonatingExecutorService$2.call(ImpersonatingExecutorService.java:71) at java.util.concurrent.FutureTask.run(FutureTask.java:266) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:748) 2021-05-28 09:00:56.899+0000 [id=2793] INFO o.c.j.p.k.KubernetesSlave#_terminate: Terminating Kubernetes instance for agent jenkins-slave-7w7lp 2021-05-28 09:00:56.989+0000 [id=2793] INFO o.c.j.p.k.KubernetesSlave#deleteSlavePod: Terminated Kubernetes instance for agent infra/jenkins-slave-7w7lp Terminated Kubernetes instance for agent infra/jenkins-slave-7w7lp 2021-05-28 09:00:56.990+0000 [id=2793] INFO o.c.j.p.k.KubernetesSlave#_terminate: Disconnected computer jenkins-slave-7w7lp Disconnected computer jenkins-slave-7w7lp `

_Originally posted by @rranjith406 in https://github.com/jenkinsci/docker-agent/issues/659