fgci-org / fgci-ansible

:microscope: Collection of the Finnish Grid and Cloud Infrastructure Ansible playbooks
MIT License
54 stars 18 forks source link

Don't run opensm on compute nodes #175

Closed jabl closed 7 years ago

jabl commented 7 years ago

With lots of hosts, opensm can require a non-trivial amount of CPU time. So it shouldn't be run on a compute node where it can disturb batch jobs. Also, with lots of hosts, if all of them run opensm in standby mode I guess there will be a lot of polling on the network to check whether the master is still up.

martbhell commented 7 years ago

OK sounds reasonable :)

However, I think it would be better to keep managing the opensm service but set the service to disabled instead. I'll close this PR and add it manually.

Something like this:

rdma_opensm_enabled: "no"

The task:

- name: Manage the opensm service
  service:
    name: "{{ rdma_opensm_service }}"
    state: "{{ rdma_opensm_state }}"
    enabled: "{{ rdma_opensm_enabled }}"
jabl commented 7 years ago

Thanks, I guess that works too. :)