hashicorp / terraform-provider-google

Terraform Provider for Google Cloud Platform
https://registry.terraform.io/providers/hashicorp/google/latest/docs
Mozilla Public License 2.0
2.34k stars 1.74k forks source link

NVIDIA MPS not available under gpu_sharing_strategy field of google_container_cluster && google_container_node_pool resources #17852

Open whyevenquestion1t opened 7 months ago

whyevenquestion1t commented 7 months ago

Community Note

Description

https://cloud.google.com/kubernetes-engine/docs/how-to/nvidia-mps-gpus#deploy GCP has a Multi Processing Service feature that has a configuration of gpu-sharing-strategy=mps through the gcloud CLI. Unfortunately, that is the only way to deploy it. I am looking forward to being able to deploy a GKE cluster and configure a node pool with MPS through terraform.

New or Affected Resource(s)

Potential Terraform Configuration

gpu_sharing_config {
        gpu_sharing_strategy       = "MPS"
        max_shared_clients_per_gpu = var.max_shared_clients_per_gpu
      }

References

https://cloud.google.com/kubernetes-engine/docs/how-to/nvidia-mps-gpus#enable-node-pool

b/335463855

melinath commented 6 months ago

This is already supported as gpu_sharing_config, which takes arbitrary string values - the docs just need to be updated.

whyevenquestion1t commented 6 months ago

This is already supported as gpu_sharing_config, which takes arbitrary string values - the docs just need to be updated.

Thank you! Setting gpu_sharing_strategy = "mps" worked for me