GoogleCloudPlatform / kubeflow-distribution

Blueprints for Deploying Kubeflow on Google Cloud Platform and Anthos
Apache License 2.0
80 stars 63 forks source link

deploying management cluster error #435

Closed C-Jaewon closed 1 year ago

C-Jaewon commented 1 year ago

https://googlecloudplatform.github.io/kubeflow-gke-docs/docs/deploy/management-setup/

The same problem continues to occur in the process of proceeding with the tutorial.

The management cluster name "kubeflow-m" is valid.
gcloud services enable krmapihosting.googleapis.com \
    container.googleapis.com \
    cloudresourcemanager.googleapis.com
Operation "operations/acat.p2-174928385958-836eadbf-b770-4196-a5e4-fc4b82d2a278" finished successfully.
gcloud anthos config controller create kubeflow-m --location=asia-northeast1
Create request issued for: [kubeflow-m]
Waiting for operation [projects/kube-394803/locations/asia-northeast1/operations/operation-1691116470094-6020fbc794c2f-4fdc56cd-7a4d1b65] to complete...failed.                     
ERROR: (gcloud.anthos.config.controller.create) Operation https://krmapihosting.googleapis.com/v1/projects/kube-394803/locations/asia-northeast1/operations/operation-1691116470094-6020fbc794c2f-4fdc56cd-7a4d1b65 has not finished in 1800 seconds. The operations may still be underway remotely and may still succeed; use gcloud list and describe commands or https://console.developers.google.com/ to check resource state.
make: *** [Makefile:146: create-cluster] 오류 1
Linchin commented 1 year ago

Looks like the command to create the management cluster timed out. Could you run the command gcloud containers cluster list to see whether the cluster you created exists? It feels more like a network/authentication issue as I could not replicate the error.