oxidecomputer / console

Oxide Web Console
https://console-preview.oxide.computer
Mozilla Public License 2.0
133 stars 10 forks source link

Allow failed instances to be stopped #2468

Closed david-crespo closed 4 days ago

david-crespo commented 1 week ago

Also rebooted maybe?

https://github.com/oxidecomputer/omicron/pull/6652

hawkw commented 1 week ago

Also rebooted maybe?

I believe Failed instances can be restarted by PUT /v1/instances/{id}/start, but not rebooted by PUT /v1/instances/{id}/reboot. The "reboot" API doesn't start a stopped/failed instance on a new VMM, but instead instructs the existing VMM to reboot the guest (as though the guest was power-cycled, essentially). That doesn't work when there is no VMM --- the instance will have to be started first.

charliepark commented 1 week ago

2481 looks to be a duplicate of this, though I don't know that I have enough context on the rebooting conversation above to know if it's an exhaustive duplicate or only a partial duplicate.

hawkw commented 1 week ago

@charliepark I think #2481 is definitely a duplicate of this. Re: rebooting, we should not allow rebooting a Failed instance, since that will just result in Nexus returning an error anyway. I'd probably keep this issue over #2481 since it has that additional context.

hawkw commented 1 week ago

Oh, if #2482 closes #2481, it should definitely also close this --- the rebooting thing is just kinda a red herring.

charliepark commented 1 week ago

Solid; will update #2482 and close #2481.

askfongjojo commented 1 week ago

Sorry about the dup. I should have checked before filing it. 😓

charliepark commented 1 week ago

No worries! It was actually seeing your issue that prompted my working on the fix, so it all worked out. 🙂