Open leewxgit opened 9 months ago
hey did you find a solution to your problem ?
Hi, I think the answer is RTX3090 does not support P2P communication. And because it is a hardware problem, there is no way to enable P2P in RTX3090.
Hi, I think the answer is RTX3090 does not support P2P communication. And because it is a hardware problem, there is no way to enable P2P in RTX3090.
this is very weird I have seen a lot of setup where 3090 support p2p: example here: https://forums.developer.nvidia.com/t/parallel-training-with-4-cards-4090-cannot-be-performed-on-amd-5975wx-stuck-at-the-beginning/237813/10
seems that only the 4090 does not support it
Hi, I think the answer is RTX3090 does not support P2P communication. And because it is a hardware problem, there is no way to enable P2P in RTX3090.
this is very weird I have seen a lot of setup where 3090 support p2p: example here: https://forums.developer.nvidia.com/t/parallel-training-with-4-cards-4090-cannot-be-performed-on-amd-5975wx-stuck-at-the-beginning/237813/10
seems that only the 4090 does not support it
Thanks for your information. It is for sure that 4090 does not support it. I am not sure whether 3090 could, and this is why I arise this issue. In my case, I canot find any successful way to enable P2P in 3090.
Hi, I think the answer is RTX3090 does not support P2P communication. And because it is a hardware problem, there is no way to enable P2P in RTX3090.
this is very weird I have seen a lot of setup where 3090 support p2p: example here: https://forums.developer.nvidia.com/t/parallel-training-with-4-cards-4090-cannot-be-performed-on-amd-5975wx-stuck-at-the-beginning/237813/10 seems that only the 4090 does not support it
Thanks for your information. It is for sure that 4090 does not support it. I am not sure whether 3090 could, and this is why I arise this issue. In my case, I canot find any successful way to enable P2P in 3090.
I am almost sure that 3090 support it. Tho I am having issue as well to make it work :sweat_smile:
Hi, I think the answer is RTX3090 does not support P2P communication. And because it is a hardware problem, there is no way to enable P2P in RTX3090.
this is very weird I have seen a lot of setup where 3090 support p2p: example here: https://forums.developer.nvidia.com/t/parallel-training-with-4-cards-4090-cannot-be-performed-on-amd-5975wx-stuck-at-the-beginning/237813/10 seems that only the 4090 does not support it
Thanks for your information. It is for sure that 4090 does not support it. I am not sure whether 3090 could, and this is why I arise this issue. In my case, I canot find any successful way to enable P2P in 3090.
I am almost sure that 3090 support it. Tho I am having issue as well to make it work 😅
😂
Okay I think I found a way to fix the problem,
I downgraded from driver 545 to 535 on my ubuntu machine and now I don't have issue anymore
Excellent! I think I already tried this type of method before, but I would like to try it again. 😂
Hello, I am working on testing various DNN models' training performance. I am using one server with 8 RTX3090 GPUs. The GPU interconnection is PCIe3.0x16. During my training experiments, I’ve noticed significant communication overhead. After investigating, I suspect the reseaon is that the P2P communication between GPUs cannot be enabled.
I am wondering: Does the RTX 3090 support P2P communication? Can you provide a definitive answer? I attempted to search online, and it seems that the RTX 4090 does not support P2P. However, I’m uncertain about the RTX 3090.
If 3090 can support P2P, how can I enable it?
The output of
./p2pBandwidthLatencyTest
:I have disabled all ACSCtl. The output of
sudo lspci -vvv | grep ACSCtl
: