Open shashank-parsi opened 2 weeks ago
What version of UCX do you use? @nsarka I think we don't check for ucp mem_map param features when building dpu plugin, can you please check? cc @janjust
i'm cloning ucx version as below git clone https://github.com/openucx/ucx.git -b v1.13.x
i'm cloning ucx version as below git clone https://github.com/openucx/ucx.git -b v1.13.x
Thanks, we will work on the fix. Meanwhile you can use v1.15 or newer
PR open to fix this issue here: https://github.com/openucx/ucc/pull/1032
Hello @nsarka , may i know when this PR will be merged?
Hello All, I see there is an compilation issue on ucc with master branch.
steps followed:
issue seen: make[3]: Entering directory '/home/master/rastra/rocm_tests/hipmpi/ucc/src/components/tl/ucp' CC libucc_tl_ucp_la-tl_ucp_dpu_offload.lo tl_ucp_dpu_offload.c: In function ‘ucc_tl_ucp_allreduce_sliding_window_register’: tl_ucp_dpu_offload.c:18:35: error: ‘UCP_MEM_MAP_PARAM_FIELD_EXPORTED_MEMH_BUFFER’ undeclared (first use in this function) 18 | params.field_mask = UCP_MEM_MAP_PARAM_FIELD_EXPORTED_MEMH_BUFFER; | ^
~~~~~~~~~~~ tl_ucp_dpu_offload.c:18:35: note: each undeclared identifier is reported only once for each function it appears in tl_ucp_dpu_offload.c:19:11: error: ‘ucp_mem_map_params_t’ {aka ‘struct ucp_mem_map_params’} has no member named ‘exported_memh_buffer’ 19 | params.exported_memh_buffer = packed_memh; | ^ make[3]: [Makefile:1242: libucc_tl_ucp_la-tl_ucp_dpu_offload.lo] Error 1 make[3]: Leaving directory '/home/master/rastra/rocm_tests/hipmpi/ucc/src/components/tl/ucp' make[2]: [Makefile:1592: install-recursive] Error 1 make[2]: Leaving directory '/home/master/rastra/rocm_tests/hipmpi/ucc/src/components/tl/ucp' make[1]: [Makefile:1409: install-recursive] Error 1 make[1]: Leaving directory '/home/master/rastra/rocm_tests/hipmpi/ucc/src' make: [Makefile:576: install-recursive] Error 1NOTE: issue is not seen with branch v1.3.x
Test enviromnent: