Closed jpablo-eqx closed 2 years ago
Following the guide for Equinix Metal, the workflow is stuck at 87% and it seems to have issues with write-netplan step
+----------------------+--------------------------------------+ | FIELD NAME | VALUES | +----------------------+--------------------------------------+ | Workflow ID | e193da91-f82a-11ec-986a-0242ac120003 | | Workflow Progress | 87% | | Current Task | os-installation | | Current Action | disable-apparmor | | Current Worker | 0eba0bf8-3772-4b4a-ab9f-6ebe93b90a94 | | Current Action State | STATE_SUCCESS | +----------------------+--------------------------------------+
The workflow should complete at 100%
Here's relevant logs (I think)
tink-server_1 | {"level":"info","ts":1656563265.5266938,"caller":"server/dbserver_workflow.go:234","msg":"done getting a workflow context","service":"github.com/tinkerbell/tink","workflowID":"e193da91-f82a-11ec-986a-0242ac120003","currentWorker":"0eba0bf8-3772-4b4a-ab9f-6ebe93b90a94","currentTask":"os-installation","currentAction":"disable-apparmor","currentActionIndex":"6","currentActionState":"STATE_SUCCESS","totalNumberOfActions":8} tink-server_1 | {"level":"info","ts":1656563266.8126962,"caller":"server/dbserver_worker_workflow.go:82","msg":"received action status: STATE_RUNNING","service":"github.com/tinkerbell/tink","actionName":"write-netplan","workflowID":"e193da91-f82a-11ec-986a-0242ac120003","taskName":"os-installation"} boots_1 | {"level":"info","ts":1656563266.8126342,"caller":"syslog/receiver.go:107","msg":"host=192.168.56.43 facility=daemon severity=ERR app-name=eafa7d318d9c procid=1198 msg=\"{\\\"level\\\":\\\"info\\\",\\\"ts\\\":1656563265.1012092,\\\"caller\\\":\\\"worker/worker.go:442\\\",\\\"msg\\\":\\\"reporting Action Status\\\",\\\"service\\\":\\\"github.com/tinkerbell/tink\\\",\\\"workerID\\\":\\\"0eba0bf8-3772-4b4a-ab9f-6ebe93b90a94\\\",\\\"workflowID\\\":\\\"e193da91-f82a-11ec-986a-0242ac120003\\\",\\\"actionName\\\":\\\"write-netplan\\\",\\\"taskName\\\":\\\"os-installation\\\",\\\"workflowID\\\":\\\"e193da91-f82a-11ec-986a-0242ac120003\\\",\\\"workerID\\\":\\\"0eba0bf8-3772-4b4a-ab9f-6ebe93b90a94\\\",\\\"actionName\\\":\\\"write-netplan\\\",\\\"taskName\\\":\\\"os-installation\\\",\\\"status\\\":\\\"STATE_RUNNING\\\"}\\n\"","service":"github.com/tinkerbell/boots","pkg":"syslog"} boots_1 | {"level":"info","ts":1656563266.8141456,"caller":"syslog/receiver.go:107","msg":"host=192.168.56.43 facility=daemon severity=ERR app-name=eafa7d318d9c procid=1198 msg=\"{\\\"level\\\":\\\"error\\\",\\\"ts\\\":1656563265.1027963,\\\"caller\\\":\\\"worker/worker.go:445\\\",\\\"msg\\\":\\\"failed to report action status: rpc error: code = FailedPrecondition desc = invalid action index for workflow\\\",\\\"service\\\":\\\"github.com/tinkerbell/tink\\\",\\\"workerID\\\":\\\"0eba0bf8-3772-4b4a-ab9f-6ebe93b90a94\\\",\\\"workflowID\\\":\\\"e193da91-f82a-11ec-986a-0242ac120003\\\",\\\"actionName\\\":\\\"write-netplan\\\",\\\"taskName\\\":\\\"os-installation\\\",\\\"workflowID\\\":\\\"e193da91-f82a-11ec-986a-0242ac120003\\\",\\\"workerID\\\":\\\"0eba0bf8-3772-4b4a-ab9f-6ebe93b90a94\\\",\\\"actionName\\\":\\\"write-netplan\\\",\\\"taskName\\\":\\\"os-installation\\\",\\\"status\\\":\\\"STATE_RUNNING\\\",\\\"error\\\":\\\"failed to report action status: rpc error: code = FailedPrecondition desc = invalid action index for workflow\\\",\\\"errorVerbose\\\":\\\"rpc error: code = FailedPrecondition desc = invalid action index for workflow\\\\nfailed to report action status\\\\ngithub.com/tinkerbell/tink/cmd/tink-worker/worker.(*Worker).reportActionStatus\\\\n\\\\t/home/runner/work/tink/tink/cmd/tink-worker/worker/worker.go:445\\\\ngithub.com/tin\"","service":"github.com/tinkerbell/boots","pkg":"syslog"}
Hey @jpablo-eqx, thanks for reporting this. I believe that this PR: https://github.com/tinkerbell/tink/pull/632 should fix the issue.
Following the guide for Equinix Metal, the workflow is stuck at 87% and it seems to have issues with write-netplan step
Expected Behaviour
The workflow should complete at 100%
Current Behaviour
Here's relevant logs (I think)
Possible Solution
Steps to Reproduce (for bugs)
Context
Your Environment