Open elamaran11 opened 1 year ago
Thankyou @yxinchen @jiayiwang7 being on the call. Since the cluster nodes are having disk pressure, the cluster is beyond the point to be recoverable. So per your recommendation, im going ahead with crashing the cluster, recreating it with xlarge with container volume for CP with 100 GB and 2xlarge with container volume of 500 GB for DP.
Hi @elamaran11 - was this issue resolved once you increased resources on your cluster?
Hi @csplinter We are seeing this again from last couple of weeks. I want to try to upgrade to latest version of K8s in Snow and see if i see the issue again. If i dont see, i will close it. If not will reach back.
What happened:
SnowBallEdge EKS-A Upgrade fails for Instance Type Upgrade. I tried to upgrade from
large
instance to2xlarge
instance for nodes and the upgrade fails with below errorsWhat you expected to happen:
Upgrade of worker nodes to
2xlarge
instance.How to reproduce it (as minimally and precisely as possible):
Create an EKS-A Cluster on Snow with
large
instance of 3 node size for CP and DP and try to upgrade the instance type to2xlarge
for workers alone.Anything else we need to know?:
Environment: