Open mackyanyan opened 1 year ago
We have deprecated the previous 3.2 versions. Please update to the latest SSM Agent v3.2.582.0. Please let us know if the problem persists.
The logic to sleep for 24 hours is still present in v3.2.582.0
https://github.com/aws/amazon-ssm-agent/blob/7d0a6c29e6a44004830adb2d4052e2f4f63fa9f8/core/app/credentialrefresher/credentialrefresher.go#L54 https://github.com/aws/amazon-ssm-agent/blob/7d0a6c29e6a44004830adb2d4052e2f4f63fa9f8/core/app/credentialrefresher/credentialrefresher.go#L231
@Seantonomous Got it. We'll be working on this issue.
@sluggard76 @Seantonomous Thank you for your reaction.
Unfortunately, even with the latest version, the situation remains unchanged. The root cause is that http400 errors often occur, and for each occurrence of an http400 error, we perform a recovery operation through manual restart of SSM on a daily basis. It would be desirable to implement measures such as retrying when an error occurs or retrying after a short sleep time.
same issue +1
3.2.582.0 도 같은 문제가 발생합니다.
I oversee the operation of over 200 onpremis machines with SSM We implement frequent updates. When I upgraded agent version from 3.1.1927 to 3.2.286, persistent disconnections emerged. The accompanying log records instances of these occurrences.