uselagoon / build-deploy-tool

Tool to generate build resources
2 stars 5 forks source link

Storage calculator pod causes Lagoon build failure #293

Closed smlx closed 4 months ago

smlx commented 6 months ago

Consider this series of events:

  1. Storage calculator pod starts running to check the storage of a service with a persistent storage volume. e.g. solr.
  2. A Lagoon build is triggered, causing the solr pod to be redeployed.
  3. The solr pod is scheduled onto a different Kubernetes node than the storage calculator.

The solr pod will now be stuck in ContainerCreating because it cannot bind the RWO volume, which is already attached to the storage calculator pod.

This will cause a Lagoon build failure that the customer cannot influence. In fact, all subsequent builds will fail until one of the following occurs:

Here's a sceenshot of this problem occuring. The storage calculator has been running (and blocking any Lagoon builds) for over two hours at this point: screenshot_2023-12-18-102634