radius-project / radius

Radius is a cloud-native, portable application platform that makes app development easier for teams building cloud-native apps.
https://radapp.io
Apache License 2.0
1.44k stars 88 forks source link

Deployment was failed due to `tf init` failure #7440

Open youngbupark opened 4 months ago

youngbupark commented 4 months ago

Steps to reproduce

When functional test deployed TF recipe, it failed because TF init was failed by the external issue.

In this issue, Radius tries to run tf init with k8s secret store tf state storage, but tf init step was failed because k8s API server was not able to do TLS handshake. There are numerous reasons for this problem, but for this case, adding retry for tf init might be able to mitigate the failures.

For example,

    cli.go:418: [rad]   "details": [
    cli.go:418: [rad]     {
    cli.go:418: [rad]       "code": "ResourceDeploymentFailure",
    cli.go:418: [rad]       "message": "Failed",
    cli.go:418: [rad]       "target": "/planes/radius/local/resourceGroups/kind-radius/providers/Applications.Core/extenders/corerp-resources-terraform-azstorage",
    cli.go:418: [rad]       "details": [
    cli.go:418: [rad]         {
    cli.go:418: [rad]           "code": "RecipeDeploymentFailed",
    cli.go:418: [rad]           "message": "terraform init failure: exit status 1\n\nError: Failed to get existing workspaces: Get \"https://172.16.0.1:443/api/v1/namespaces/radius-system/secrets?labelSelector=tfstate%3Dtrue\": net/http: TLS handshake timeout\n\n\n"
    cli.go:418: [rad]         }
    cli.go:418: [rad]       ]
    cli.go:418: [rad]     },

Observed behavior

No response

Desired behavior

No error

Workaround

No response

rad Version

edge

Operating system

No response

Additional context

No response

Would you like to support us?

AB#11703

radius-triage-bot[bot] commented 4 months ago

:wave: @youngbupark Thanks for filing this bug report.

A project maintainer will review this report and get back to you soon. If you'd like immediate help troubleshooting, please visit our Discord server.

For more information on our triage process please visit our triage overview

radius-triage-bot[bot] commented 4 months ago

:+1: We've reviewed this issue and have agreed to add it to our backlog. Please subscribe to this issue for notifications, we'll provide updates when we pick it up.

We also welcome community contributions! If you would like to pick this item up sooner and submit a pull request, please visit our contribution guidelines and assign this to yourself by commenting "/assign" on this issue.

For more information on our triage process please visit our triage overview

sylvainsf commented 4 months ago

This has recurred 3-4 times in the past few months so could be a candidate for retry.