openucx / ucc

Unified Collective Communication Library
https://openucx.github.io/ucc/
BSD 3-Clause "New" or "Revised" License
177 stars 85 forks source link

TL/CUDA: Linear Broadcast for GPU #948

Open ikryukov opened 3 months ago

ikryukov commented 3 months ago

What

Describe what this PR is doing.

Why ?

Justification for the PR. If there is existing issue/bug please reference. For bug fixes why and what can be merged in a single item.

How ?

It is optional but for complex PRs please provide information about the design, architecture, approach, etc.

ikryukov commented 3 months ago

Configuration string: ./configure --with-ucx=$HPCX_UCX_DIR --with-cuda=/usr/local/cuda --with-mpi=$HPCX_MPI_DIR --enable-gtest --prefix=$PWD/install --with-nvcc-gencode="-gencode=arch=compute_80,code=sm_80" --enable-debug Run string: mpirun --mca coll ^hcoll --mca coll_ucc_enable 0 -x LD_LIBRARY_PATH=/home/ikryukov/work/ucc/install/lib:$LD_LIBRARY_PATH -x UCC_CLS=basic -x UCC_TLS=ucp,cuda -x xUCC_LOG_LEVEL=info -x UCC_TL_CUDA_LOG_LEVEL=debug -x UCC_LOG_LEVEL=info -x UCC_CONFIG_FILE= -np 2 ./install/bin/ucc_test_mpi -c bcast --teams world -M cuda -O 0

swx-jenkins3 commented 3 months ago

Can one of the admins verify this patch?