linux-rdma / opensm

Other
66 stars 36 forks source link

opensm high load issue #29

Open wenngong opened 2 years ago

wenngong commented 2 years ago

We deployed a large HPC cluster, which has more than 1k nodes in same subnet. We encountered opensm high load issue when one switch down. After ibdump the traffic on opensm master node, we find much path record request traffic which may cause opensm high load issue.

My question is:

  1. what scenario will cause path record request?
  2. Will one arp request cause one path record request?