adobe / aquarium-fish

Your best secure distributed heterogeneous dynamic compute resource manager for CI
Other
7 stars 3 forks source link

Repeated failures need to make label or driver disabled for the node #4

Open sparshev opened 2 years ago

sparshev commented 2 years ago

We need to have some limits on failure of the label to inform the cluster that this node can not execute the particular label or maybe the entire driver due to the fails in allocating.

Right now if it fails - it will continue to fail.