NVIDIA / deepops

Tools for building GPU clusters
BSD 3-Clause "New" or "Revised" License
1.25k stars 326 forks source link

Error Running ansible-playbook on slurm-cluster: Docker-ce Repository Activation Issue #1302

Closed sikso1892 closed 8 months ago

sikso1892 commented 11 months ago

Environment:

Issue Description:

I am experiencing an issue with setting up a Slurm cluster using the DeepOps project. The error occurs when executing the ansible-playbook command as follows:

ansible-playbook -l slurm-cluster playbooks/slurm-cluster.yml

Error Message:

The task to enable the Docker CE repository fails with the following error:

TASK [../../submodules/kubespray/roles/container-engine/docker : ensure docker-ce repository is enabled] ********************************************************************************************************
failed: [novo-18] (item=deb [arch=amd64] https://download.docker.com/linux/ubuntu bionic stable
) => changed=false
  ansible_loop_var: item
  item: |-
    deb [arch=amd64] https://download.docker.com/linux/ubuntu bionic stable
  msg: 'Error: Module unable to decode valid JSON on stdin.  Unable to figure out what parameters were passed'

PLAY RECAP ******************************************************************************************************************************************************************************************************
novo-18                    : ok=62   changed=1    unreachable=0    failed=1    skipped=46   rescued=0    ignored=1
novo-200                   : ok=23   changed=1    unreachable=0    failed=0    skipped=9    rescued=0    ignored=1
github-actions[bot] commented 9 months ago

This issue is stale because it has been open for 60 days with no activity. Please update the issue or it will be closed in 7 days.