NVIDIA / deepops

Tools for building GPU clusters
BSD 3-Clause "New" or "Revised" License
1.25k stars 326 forks source link

Cluster creation getting failed #1230

Closed saingithub closed 1 year ago

saingithub commented 2 years ago

I was deployed mater node and it is ready and running.

But the cluster creation step got exciting at the below task TASK [rsyslog-client : configure syslog forwarding] *****

No error also getting in debug mode.

Please help me on this

ajdecon commented 2 years ago

But the cluster creation step got exciting at the below task

While I often find syslog exciting :wink: it's not clear to me what the actual issue you're seeing is. There is no error in your description, and you don't describe how the failure is occurring.

Can you please provide a full log of your Ansible run (preferably in a gist) and describe the problem you're facing?

saingithub commented 2 years ago

After the below task which was completed as ok TASK [rsyslog-client : configure syslog forwarding] **** task path: /root/deepops/roles/rsyslog-client/tasks/main.yml:7

The ansible execution get stopped PLAY RECAP ***** dgx : ok=5 changed=2 unreachable=0 failed=1 skipped=4 rescued=0 ignored=0 dgxmgt : ok=676 changed=49 unreachable=0 failed=0 skipped=1250 rescued=0 ignored=0