issues
search
GoogleCloudPlatform
/
cluster-toolkit
Cluster Toolkit is an open-source software offered by Google Cloud which makes it easy for customers to deploy AI/ML and HPC environments on Google Cloud.
Apache License 2.0
186
stars
124
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Bump golang.org/x/sys from 0.24.0 to 0.25.0
#3013
dependabot[bot]
opened
6 hours ago
0
add support for enabling tcpx/o in a3 and a3mega vm, provide script for injecting rxdm sidecar and other required components into user workload
#3012
chengcongdu
opened
2 days ago
0
Use local-ssd for enroot temp space.
#3011
samskillman
opened
2 days ago
1
adding module cache to prevent repeated module downloads during modul…
#3010
RachaelSTamakloe
opened
2 days ago
0
Add cluster deployment step to a3-megagpu-8g integration test
#3009
tpdownes
closed
3 days ago
0
Logging to BigQuery can fail if number of rows to insert is too large
#3008
fdmalone
opened
3 days ago
0
Fix Slurm tag on A3 high integration test to match v6 blueprint
#3007
tpdownes
opened
3 days ago
0
Identify tests failing on develop
#3006
annuay-google
opened
4 days ago
0
Default to zonal bulkInsert
#3005
mr0re1
opened
4 days ago
0
Release v1.39.0
#3004
alyssa-sm
opened
4 days ago
0
Add machine type availability checks
#3003
annuay-google
opened
4 days ago
2
Updating image builder for internal provider
#3002
cdunbar13
closed
3 days ago
0
Disable bootstrap GitHub actions in Spack repo before first installation
#3001
rohitramu
opened
5 days ago
0
Bump cryptography from 42.0.4 to 43.0.1 in /community/front-end/ofe
#3000
dependabot[bot]
closed
4 days ago
1
Update debian default image in chrome-remote-desktop module
#2999
harshthakkar01
closed
4 days ago
0
Revert "SlurmGCP. Do not add empty startup scripts"
#2998
harshthakkar01
closed
5 days ago
1
Revisit the Reservation Interface for GKE Blueprints
#2997
arajmane-g
closed
5 days ago
0
Bump github.com/hashicorp/hcl/v2 from 2.21.0 to 2.22.0
#2996
dependabot[bot]
closed
5 days ago
1
Bump google.golang.org/api from 0.186.0 to 0.195.0
#2995
dependabot[bot]
closed
4 days ago
3
Add machine type availability checks by zone
#2994
annuay-google
closed
4 days ago
0
Fix for cleanup script. The last input is optional
#2993
cdunbar13
closed
4 days ago
0
Catch "None" fields in slurm job datetime data for BigQuery
#2992
fdmalone
closed
4 days ago
2
Catch None fields in slurm job data.
#2991
fdmalone
closed
1 week ago
0
Revert "Add machine type availability checks to slurm-gcp-v6-nodeset"
#2990
harshthakkar01
closed
1 week ago
0
Slurm accounting data not loading to BigQuery
#2989
fdmalone
opened
1 week ago
4
SlurmGCP. Don't skip clean up of "stopped" instances
#2988
mr0re1
closed
3 days ago
0
Add enable-maintenance-reservation flag in slurm to control reservation for scheduled maintenance
#2987
harshthakkar01
closed
3 days ago
0
Disable automatic updates in daos installation script
#2986
harshthakkar01
closed
1 week ago
0
kubernetes provider added to gke-cluster module
#2985
sharabiani
closed
1 week ago
0
Bump slurm-gcp version & add enroot/pyxis test
#2984
mr0re1
closed
1 week ago
0
Add enroot/pyxis step to a3 series integration tests
#2983
tpdownes
opened
1 week ago
0
Improved serial port collection tool
#2982
cdunbar13
closed
1 week ago
0
Don't set `automaticRestart: false`
#2981
mr0re1
closed
1 week ago
0
implement kubectl-apply module
#2980
sharabiani
closed
4 days ago
0
Use sackd for the login nodes
#2979
jvilarru
opened
1 week ago
4
Prevent use of google provider 6.0 where breaking changes are in use
#2978
tpdownes
closed
1 week ago
0
Prevent use of google provider 6.0 in vm-instance
#2977
tpdownes
closed
1 week ago
0
Remove default service account
#2976
alyssa-sm
opened
1 week ago
0
Increate `ps-slurm.yaml` startup scripts timeout
#2975
mr0re1
closed
1 week ago
0
Adding new network features to branch
#2974
cdunbar13
closed
5 days ago
0
Bump google.golang.org/api from 0.186.0 to 0.194.0
#2973
dependabot[bot]
closed
1 week ago
3
SlurmGCP. Fixes & improvements around config fetching
#2972
mr0re1
closed
1 week ago
0
Expose maintenance interval as a blueprint setting for node pools in GKE
#2971
annuay-google
closed
1 week ago
2
Add integration test for Parallelstore (vm instances)
#2970
abbas1902
closed
5 days ago
3
Support named placements in GKE node pools
#2969
arajmane-g
closed
1 week ago
0
Fix local_ssd_config issue that forces node-pool recreation
#2968
sharabiani
closed
1 week ago
0
Add `NodeState` enum
#2966
mr0re1
opened
2 weeks ago
0
SlurmGCP. Do not add empty startup scripts
#2965
mr0re1
closed
1 week ago
1
Add description as a hint to "missed required variable" error
#2964
mr0re1
opened
2 weeks ago
0
Add example of using topology with pytorch
#2963
samskillman
opened
2 weeks ago
0
Next