So if the second node pool for GPU deployment is configured then its taint: nvidia/gpu=present will not be used to derive the appropriate toleration for ModelDeployment workloads. Therefore there is no way to schedule workload on this node pool even using right node selector.
@karbyshevds @easokol please take a look, I think it should be fixed in 1.4 release
Currently, we derive tolerations for our workloads only from the first node pool in the list of the appropriate node pools.
So if the second node pool for GPU deployment is configured then its taint: nvidia/gpu=present will not be used to derive the appropriate toleration for ModelDeployment workloads. Therefore there is no way to schedule workload on this node pool even using right node selector.
@karbyshevds @easokol please take a look, I think it should be fixed in 1.4 release