microsoft / azure_arc

Automated Azure Arc, Edge, and Platform environments
https://aka.ms/ArcJumpstart
Creative Commons Attribution 4.0 International
734 stars 535 forks source link

VM has reported a failure when processing extension 'Bootstrap' (publisher 'Microsoft.Compute' and type 'CustomScriptExtension') #2503

Closed potejasw closed 2 months ago

potejasw commented 3 months ago

Is your issue related to a Jumpstart scenario, HCIBox

Describe the issue or the bug

To Reproduce

Environment summary

azure-cli 2.56.0 *

core 2.56.0 * telemetry 1.1.0

Extensions: init 0.1.0

Dependencies: msal 1.24.0b2 azure-mgmt-resource 23.1.0b2

Have you looked at the Troubleshooting and Logs section?

Screenshots

Additional context

sebassem commented 3 months ago

can you share more information, screenshots on the error ? Have you tried a redeployment ?

potejasw commented 3 months ago

Yes, I have tried the redeployment its failed with same error. image

sebassem commented 3 months ago

this looks like a problem with the service principal credentials. Please double check that you have provided the right SPN credentials with the right permissions. If all looks good please RDP into the machine and share the bootstrap.log

potejasw commented 3 months ago

I tried to build the HCIBox from the begging as a fresh deployment. Again its failed with same error. This time I made sure provided the SPN ID and details.

image image image

VM has deployed and I don't see the Bootstrap log under this location: C:\HCIBox\Logs\Bootstrap.log image

image

sebassem commented 3 months ago

can you share your parameters file with obfuscating the secrets ?

potejasw commented 3 months ago

main.parameters.json

Attached.

sebassem commented 3 months ago

When did you clone the repo? I assume it's an old copy since there are some parameters not available in your file. Also please double check in the main.bicep file that the githubAccount is set to microsoft and githubBranch is set to main. If that's the case, please perform a new clone

potejasw commented 3 months ago

I deleted older clone and created the new clone. Tried to redeploy and its failed with below error.

image

Here is the latest json file.

main.parameters.json

This is on Azure portal: image

sebassem commented 3 months ago

have you changed the property imageReference in the host.bicep file ?

potejasw commented 3 months ago

No, I haven't touched that.

image

sebassem commented 3 months ago

@dkirby-ms any thoughts on this ?

dkirby-ms commented 3 months ago

@potejasw you need to revoke/rotate your SPN secret immediately if you havent already.

potejasw commented 3 months ago

Yes, Wiped out all RG and starting the new build.

sebassem commented 2 months ago

Hello @potejasw , what is your current status?

potejasw commented 2 months ago

@sebassem, Thank you for the follow up.

In the last week, I have wiped out and build the HCI box couple of times and that worked. Now I have stable lab deployed. I have to build the cluster now.

sebassem commented 2 months ago

Thanks for the update, glad you got it working. Will close the issue now as resolved.