Open yangrudan opened 1 month ago
what is the performance when you set UCX_NET_DEVICES=mlx5_1:1
?
what is the performance when you set
UCX_NET_DEVICES=mlx5_1:1
?
NIC mlx5_0 and NIC mlx5_1 both are the best pcie topo for my xpu. So when set UCX_NET_DEVICES=mlx5_1:1
, it is also fast as below.
can you try to profile it with linux perf and check for the hotspots?
Describe the bug
My environment is the ucx perfest tag_bw test of the GDR in the machine. When I configure the environment variables of one network card, the measured speed is very fast. The environment variables select dual network cards and the speed is super slow. In addition, both dual network cards are optimal pcie topology.
Steps to Reproduce
My commands:
Setup and versions