kubeflow / pipelines

Machine Learning Pipelines for Kubeflow
https://www.kubeflow.org/docs/components/pipelines/
Apache License 2.0
3.63k stars 1.63k forks source link

Kubeflow on GKE SSD_TOTAL_GB Quota Related Error #10907

Closed sanjaypsachdev closed 2 months ago

sanjaypsachdev commented 5 months ago

Discussed in https://github.com/kubeflow/pipelines/discussions/10894

Originally posted by **sanjaypsachdev** June 13, 2024 I am getting the below error related to insufficient regional SSD_TOTAL_GB quota. As this is a new free trial GCP account I can't seem to be able to request a quota increase for SSD_TOTAL_GB in my chosen region. Is there a way I can reduce the requested storage volume size and get around this error without a quota increase ? Please help ! ERROR: (gcloud.anthos.config.controller.create) unexpected error occurred while waiting for SLM operation [projects/krmapihosting-slm/locations/us-central1/operations/operation-1718220591835-61ab66805ab33-508143e7-322dac40]: errored while waiting for operation: projects/krmapihosting-slm/locations/us-central1/operations/operation-1718220591835-61ab66805ab33-508143e7-322dac40: Operation failed with error: generic::invalid_argument: terraform apply failed, error: exit status 1, stderr: Error: Error creating Cluster: googleapi: Error 403: Insufficient regional quota to satisfy request: resource "SSD_TOTAL_GB": request requires '300.0' and is short '50.0'. project has a quota of '250.0' with '250.0' available. View and manage quotas at https://console.cloud.google.com/iam-admin/quotas?usage=USED&project=gke-test-project-426203. Details: [ { "@type": "type.googleapis.com/google.rpc.DebugInfo", "detail": "INSUFFICIENT_QUOTA_REGIONAL: insufficient regional quota to satisfy request: resource \"SSD_TOTAL_GB\": request requires '300.0' and is short '50.0'. project has a quota of '250.0' with '250.0' available. View and manage quotas at https://console.cloud.google.com/iam-admin/quotas?usage=USED\u0026project=gke-test-project-426203", "stackEntries": [ "cloud/kubernetes/engine/common/error_desc.go:456 +0x26 google3/cloud/kubernetes/engine/common/errdesc.(*GKEErrorDescriptor).createErr(0xc00353efc0, {0x55c2c296dbd8, 0xc3d176dc80})", "cloud/kubernetes/engine/common/error_desc.go:379 google3/cloud/kubernetes/engine/common/errdesc.(*GKEErrorDescriptor).WithDetailCtx(...)", "cloud/kubernetes/engine/common/clients/gce/quota.go:167 +0x7e google3/cloud/kubernetes/engine/common/clients/gce/gce.validateQuota.func2()", "cloud/kubernetes/engine/common/errors.go:278 +0x5d google3/cloud/kubernetes/engine/common/errors.ConcurrencyBoundCollectFn.func1()", "cloud/kubernetes/engine/common/errors.go:276 +0x8a created by google3/cloud/kubernetes/engine/common/errors.ConcurrencyBoundCollectFn in goroutine 1822496579" ] }, { "@type": "type.googleapis.com/google.rpc.RequestInfo", "requestId": "0x2ab03cda1eb8c8f2" } ]
github-actions[bot] commented 3 months ago

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

github-actions[bot] commented 2 months ago

This issue has been automatically closed because it has not had recent activity. Please comment "/reopen" to reopen it.