Closed Echozqn closed 11 months ago
Hello, thank you again for developing the llm-analysis project, it is very helpful to my work. I noticed that the A100 PCIe version uses PCIe 4.0, which has a bidirectional bandwidth of 64 GB/s, so the one-way bandwidth should be 32 GB/s. At the same time, the A100-SXM should use the third generation NVLink, which has a bidirectional speed of 600 GB/s, so the one-way speed should be 300 GB/s.
Against this background, I have a few questions to ask, which I hope will not take up too much of your precious time:
intra_node_bandwidth_in_GB_per_sec
parameter refer to the one-way transmission speed?intra_node_bandwidth_in_GB_per_sec
for A100-pcie-40gb you provided in gpu_config
? Are there possible errors?intra_node_min_message_latency
and inter_node_bandwidth_in_GB_per_sec
?Looking forward to hearing from you, thank you very much for your time and help.
I see, thank you very much for your reply.
Is your feature request related to a problem? Please describe. I recently wanted to test on T4, but I don't know how to measure intra_node information.
Describe the solution you'd like The following is the T4 information I checked, including
intra_node_bandwidth_in_GB_per_sec
intra_node_min_message_latency
inter_node_bandwidth_in_GB_per_sec
. I don’t know how to obtain it.