openucx / ucx

Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)
http://www.openucx.org
Other
1.08k stars 412 forks source link

cuda_copy_md.c:489 UCX WARN cuPointerSetAttribute error with CUDA VMM API #9895

Open MinassZhang opened 2 months ago

MinassZhang commented 2 months ago

Describe the bug

Server uses cuda virtual memory management API (cuMemAddressReserve, cuMemCreate..) to create memory region and issues error cuda_copy_md.c:489 UCX WARN cuPointerSetAttribute(0x7f85c0000000, SYNC_MEMOPS) error: operation not supported.

Can RNDV protocl support the memory space allocated by CUDA VMM API?

Steps to Reproduce

Setup and versions

Additional information

yosefe commented 2 months ago

VMM is currently not supported. It should be fixed by #9867 cc @Akshay-Venkatesh