I have 2 GPUs and they are under the same PCIe Switch. When I enable P2P access and allocate memory blocks on device1 and device0, I can access device1's memory block in kernels launched on device 0 within current process.
I wander whether a cuda IPC memory handle, allocated on device1 from another process, can somehow be directly accessed in kernel launched on device0 within current process.
I have 2 GPUs and they are under the same PCIe Switch. When I enable P2P access and allocate memory blocks on device1 and device0, I can access device1's memory block in kernels launched on device 0 within current process. I wander whether a cuda IPC memory handle, allocated on device1 from another process, can somehow be directly accessed in kernel launched on device0 within current process.