Open q131172019 opened 2 years ago
thanks for filing this issue to track this issue. this is due to the fact that distributor allocates machines in slices, so it is not exactly the number of machines the client requested, it is a bit over per the size of the slices being allocated to the client. which can be ~30 or so,
evaluate first step for 930 for cost.
In test for 500K nodes / 2 regions / 20 schedulers / 25K nodes per scheduler, the first 19 schedulers are successfully allocated with requested machines greater than 25k nodes due to overhead so that the remaining nodes are less than 25K. The result is the 20th scheduler is not allocated with 25k requested machines due to "Not enough hosts"