CloudSnorkel / cdk-github-runners

CDK constructs for self-hosted GitHub Actions runners
https://constructs.dev/packages/@cloudsnorkel/cdk-github-runners/
Apache License 2.0
255 stars 37 forks source link

Ec2 has not been terminated #515

Closed pharindoko closed 3 weeks ago

pharindoko commented 2 months ago

Hey,

I have this rare case that an ec2 is still available for days.

aws console: image image

system-log image

Anyone had the same issue ?

kichik commented 2 months ago

Anything of interest in /var/log/cfn-cmd.log, /var/log/cfn-cmd-init.log and the output of dmesg?

pharindoko commented 2 months ago

Anything of interest in /var/log/cfn-cmd.log, /var/log/cfn-cmd-init.log and the output of dmesg?

Not really - could I see some of that stuff in the cloudwatch logs. I'm unable to connect to the instance via ssm.

kichik commented 2 months ago

Those logs do not go in CloudWatch.

It's strange that the step function completed but the system log doesn't show the aws stepfunctions send-task-success call. Whenever I see something like this, I immediately assume OOM. That's why I asked for dmesg. But it's very strange for that to happen so fast that the log would only show one heartbeat (should be one every minute) and nothing else. Even the runner log itself seems truncate. Is it just as terse in CloudWatch?

pharindoko commented 2 months ago

I'll check what I can find in cloudwatch. It's a super rare case but appeared now twice in one week. Maybe I need an additional watchdog. This is why I'm asking if anyone else had that problem.

pharindoko commented 3 weeks ago

Related to #537