nforgeio / neonKUBE

Public NeonKUBE Kubernetes distribution related projects
https://neonkube.io
Apache License 2.0
80 stars 13 forks source link

Multi-node cluster setup failing: wait for system database timeout #1286

Closed jefflill closed 2 years ago

jefflill commented 3 years ago

Setup fails when building the neonCLOUD/Devops/jefflill/home/small-hyperv.cluster.yaml cluster:

image

This cluster consists of 1 master and three workers and does not deploy any cStor devices. I tried restarting setup and that failed again the same way. There's a decent chance that this will impact @granth7.

Here are the cluster setup logs:

logs.zip

from cluster.log:

STATUS: *** ERROR: [TimeoutException]: Timeout waiting for the predicate to return TRUE.

*** ERROR: [TimeoutException]: Timeout waiting for the predicate to return TRUE.
*** STACK:
   at Neon.Common.NeonHelper.WaitForAsync(Func`1 predicate, TimeSpan timeout, Nullable`1 pollInterval, String timeoutMessage) in C:\src\neonKUBE\Lib\Neon.Common\Helper\NeonHelper.Misc.cs:line 501
   at Neon.Kube.KubeSetup.WaitForStatefulSetAsync(ISetupController controller, String namespace, String name, String labelSelector, String fieldSelector) in C:\src\neonKUBE\Lib\Neon.Kube.Setup\KubeSetup.Operations.cs:line 3445
   at Neon.Common.NeonHelper.WaitAllAsync(IEnumerable`1 tasks) in C:\src\neonKUBE\Lib\Neon.Common\Helper\NeonHelper.Misc.cs:line 620
   at Neon.Kube.KubeSetup.<>c__DisplayClass99_0.<<InstallSystemDbAsync>b__3>d.MoveNext() in C:\src\neonKUBE\Lib\Neon.Kube.Setup\KubeSetup.Operations.cs:line 3345
--- End of stack trace from previous location ---
   at Neon.Kube.NodeSshProxy`1.InvokeIdempotentAsync(String actionId, Func`1 action) in C:\src\neonKUBE\Lib\Neon.Kube\Proxies\NodeSshProxy.cs:line 528
   at Neon.Kube.KubeSetup.InstallSystemDbAsync(ISetupController controller, NodeSshProxy`1 master) in C:\src\neonKUBE\Lib\Neon.Kube.Setup\KubeSetup.Operations.cs:line 3340
   at Neon.Kube.KubeSetup.SetupClusterAsync(ISetupController controller, Int32 maxParallel) in C:\src\neonKUBE\Lib\Neon.Kube.Setup\KubeSetup.Operations.cs:line 290
   at Neon.Kube.SetupController`1.<>c__DisplayClass30_0.<<ExecuteStep>b__6>d.MoveNext() in C:\src\neonKUBE\Lib\Neon.Kube\SetupController\SetupController.cs:line 888
jefflill commented 3 years ago

Here are the deployments:

image

...and here are the pods:

image

jefflill commented 3 years ago

I created the neonKUBE/storage branch with my changes. Here's the commit:

https://github.com/nforgeio/neonKUBE/commit/a15755225276cb4ce0c8853284eb2880685c6acd

I don't think we need to create a neonCLOUD branch for this. We can probably merge back to master and publish the nugets when this works.