Closed genaimagician closed 9 months ago
Why did you add a GPU node pool variable in the TPU training guide?
I used it to launch GPU node pools to test out the terraform and left it as an example for users. Removed it per our offline discussion and modified gke-ml module readme. Please look at the new commit. Thanks!
I've launched GPU node pools by running the code. Here are the screenshots of launched node, logs and GPU setup i used for testing (e.g. GPU types).
https://docs.google.com/document/d/1FQB-KWGtcBKcfWUuj7U5E8HMc6gIeVSStqdcltiBBjk/edit?tab=t.0