Closed biocyberman closed 2 years ago
It's difficult to diagnose what may be happening here without more information. Can you please share the following information?
diff -r config.example config
)This issue is stale because it has been open for 60 days with no activity. Please update the issue or it will be closed in 7 days.
Hi I am testing DeepOps with one VMware virtual machine as slurm master, and management node, and one DGX1 as slurm compute node.
ansible-playbook -l slurm-cluster playbooks/slurm-cluster.yml
I am facing various issues: slurm build and installation do not happen on both master and compute node.
ansible-playbook --tags build -l slurm-cluster playbooks/slurm-cluster.yml
got slurm to be built, but no installation. That is actually OK because of--tags
argument. Next I had to run:ansible-playbook -l slurm-cluster playbooks/slurm-cluster/slurm.yml
This installs slurm but the deployment is still incomplete or failed at some tasks. Tasks got skipped, and directories are not create even though they do not exist. For example in the list below, only
/etc/slurm
exists.