Open melvyn-apryl opened 2 years ago
I suggest you check the /var/log/eb-engine.log
in the EC2 instance associated to check the error in more detail. Probably you included a change that is breaking your application.
I know the cause. But the problem is that this message was sent by AWS:
2022-06-28 18:18:05 UTC+0200 ERROR Failed to deploy application.
΅But was not processed at this line:
if (ev.Message.match(/Failed to deploy application/)) {
And so the action never terminates, which means the 120 seconds to prevent this kind of thing from happening cannot be trusted.
this happened to me, i set a git action timeout to prevent it, not great though
https://stackoverflow.com/a/59076067/1869299
my-job: runs-on: ubuntu-latest timeout-minutes: 30
This is v16, but looking at the code, this can still happen with v20. The timeout is only checked if status is ready and version labels match, the other branches have no timeout. In this case, status is ready, but version labels seem to not match or more likely the deployment failed message was not caught. Log from action:
And eb environment:
So the last of the 4 in the same second was missed here and then the loop is endless. The setting
wait_for_environment_recovery: 120
is in the job: