Open cblmemo opened 17 hours ago
This is intentional to keep aligned with cloud behavior - we try to submit a pod and let the cluster determine if it can fit or not, just like how clouds inform us if they are out of capacity. This feature also allows users to "queue" jobs by setting a provision_timeout
in their config, which lets the pod stay pending for a while before giving up.
Currently, when the k8s cluster is fully occupied, the optimizer will still shows it as candidate. For example, in the replica resources optimization result, it select k8s as resources, but actually it launches on GCP.