Open cecheta opened 1 month ago
[!IMPORTANT] The "Needs: Triage :mag:" label must be removed once the triage process is complete!
[!TIP] For additional guidance on how to triage this issue/PR, see the BRM Issue Triage documentation.
[!WARNING] Tagging the AVM Core Team (@Azure/avm-core-team-technical-bicep) due to a module owner or contributor having not responded to this issue within 3 business days. The AVM Core Team will attempt to contact the module owners/contributors directly.
[!TIP]
- To prevent further actions to take effect, the "Status: Response Overdue 🚩" label must be removed, once this issue has been responded to.
- To avoid this rule being (re)triggered, the ""Needs: Triage :mag:" label must be removed as part of the triage process (when the issue is first responded to)!
Hey @cecheta, good catch. I think I've seen this happening in a recent APIM deployment. This should be addressed but will be challenging. For one, we need to reproduce the issue while debugging. Then, we must hope that ARM actually returns some proper error that we can interpret because more often than not, information is written to the log, but not actually returned by the cmdlet. If it turns out it does not return anything useful, we may need to resort to more drastic means and add a logic that picks up after the deployment cmdlet and always pings the deployment itself with some waiting logic (effectively pulling the deployment data every x seconds until it's done).
Would you happen to have noticed a service where this occurs somewhat consistently?
Unfortunately I haven't observed this behaviour consistently for any service
Check for previous/existing GitHub issues
Issue Type?
Bug
Description
There is an intermittent error that can occur when checking the deployment status during CI:
When this occurs, a new deployment is started, however the first deployment is actually still ongoing. The second deployment is then likely to fail because there are essentially two deployments going on at the same time.
Perhaps a retry could be added when checking the deployment status?