Closed s769 closed 2 months ago
Note ia=3 ja=3 ib=3 jb=1
you have in your parameters. Those specify the first row/column of A/B that will be used (see the documentation). That's why you get nan
- the memory in the first two rows was never touched. You will not see those two rows if you specify ia=1 ja=1 ib=1 jb=1
.
As for the hang with 3 processes, please see this comment - p*q
has to be equal to the number of processes you are using (and equal to the number of GPUs). Are you running -np 3
with p=2 q=1
?
Thank you. I changed the parameters you mentioned and now the example with 2 GPUs also works
c318-004.ls6(1011)$ mpiexec -np 2 mp_potrf_potrs -verbose 1 -ia 1 -ib 1 -ja 1 -jb 1
Parameters: m=1 n=10 nrhs=1 mbA=2 nbA=2 mbB=2 nbB=2 mbQ=2 nbQ=2 mbZ=0 nbZ=0ia=1 ja=1 ib=1 jb=1 iq=1 jq=1 iz=0 jz=0 p=2 q=1 grid_layout= verbose=1
Parameters: m=1 n=10 nrhs=1 mbA=2 nbA=2 mbB=2 nbB=2 mbQ=2 nbQ=2 mbZ=0 nbZ=0ia=1 ja=1 ib=1 jb=1 iq=1 jq=1 iz=0 jz=0 p=2 q=1 grid_layout= verbose=1
[1719257178.325559] [c318-004:275852:0] ucc_mc.c:78 UCC WARN mc cpu mc was allready initilized with different thread mode: current tm 0, provided tm 2
[1719257178.325572] [c318-004:275852:0] ucc_mc.c:78 UCC WARN mc cuda mc was allready initilized with different thread mode: current tm 0, provided tm 2
[1719257178.325576] [c318-004:275852:0] ucc_ec.c:75 UCC WARN ec cpu ec was allready initilized with different thread mode: current tm 0, provided tm 2
[1719257178.325580] [c318-004:275852:0] ucc_ec.c:75 UCC WARN ec cuda ec was allready initilized with different thread mode: current tm 0, provided tm 2
[1719257178.325559] [c318-004:275853:0] ucc_mc.c:78 UCC WARN mc cpu mc was allready initilized with different thread mode: current tm 0, provided tm 2
[1719257178.325573] [c318-004:275853:0] ucc_mc.c:78 UCC WARN mc cuda mc was allready initilized with different thread mode: current tm 0, provided tm 2
[1719257178.325576] [c318-004:275853:0] ucc_ec.c:75 UCC WARN ec cpu ec was allready initilized with different thread mode: current tm 0, provided tm 2
[1719257178.325581] [c318-004:275853:0] ucc_ec.c:75 UCC WARN ec cuda ec was allready initilized with different thread mode: current tm 0, provided tm 2
[c318-004][Jun 24 14:26:19 503804][RDMA_SR][275852][error] - Error event recieved: event: RDMA_CM_EVENT_ROUTE_ERROR, error: -22
[c318-004][Jun 24 14:26:19 503874][RDMA_SR][275852][error] - Error occured during connection event handle
[c318-004][Jun 24 14:26:22 506892][RDMA_SR][275852][error] - poll failed due to poll_timeout=3000.000000, stop
[c318-004][Jun 24 14:26:22 506922][RDMA_SR][275852][error] - Poll failed
[c318-004][Jun 24 14:26:22 506928][RDMA_SR][275852][error] - Failed to connect
[c318-004][Jun 24 14:26:22 506983][RDMA_SR][275852][error] - rdma_resolve_addr failed with error: -1
[c318-004][Jun 24 14:26:22 506994][RDMA_SR][275852][error] - rdma_resolve_addr failed with error: -1
[c318-004][Jun 24 14:26:22 507005][GENERAL][275852][error] - unable to query AM service record(AM query)
[c318-004][Jun 24 14:26:23 738810][RDMA_SR][275852][error] - Error event recieved: event: RDMA_CM_EVENT_ROUTE_ERROR, error: -22
[c318-004][Jun 24 14:26:23 738838][RDMA_SR][275852][error] - Error occured during connection event handle
[c318-004][Jun 24 14:26:26 741854][RDMA_SR][275852][error] - poll failed due to poll_timeout=3000.000000, stop
[c318-004][Jun 24 14:26:26 741883][RDMA_SR][275852][error] - Poll failed
[c318-004][Jun 24 14:26:26 741890][RDMA_SR][275852][error] - Failed to connect
[c318-004][Jun 24 14:26:26 741939][RDMA_SR][275852][error] - rdma_resolve_addr failed with error: -1
[c318-004][Jun 24 14:26:26 741950][RDMA_SR][275852][error] - rdma_resolve_addr failed with error: -1
[c318-004][Jun 24 14:26:26 741956][GENERAL][275852][error] - unable to query AM service record(AM query)
print_host_matrix : Input matrix A
11.72 0.39 0.78 0.80 0.91 0.20 0.34 0.77 0.28 0.55
0.39 10.69 0.63 0.36 0.51 0.95 0.92 0.64 0.72 0.14
0.78 0.63 4.99 0.02 0.24 0.14 0.80 0.16 0.40 0.13
0.80 0.36 0.02 7.17 1.00 0.22 0.51 0.84 0.61 0.30
0.91 0.51 0.24 1.00 7.38 0.52 0.49 0.97 0.29 0.77
0.20 0.95 0.14 0.22 0.52 5.74 0.77 0.40 0.89 0.28
0.34 0.92 0.80 0.51 0.49 0.77 4.30 0.81 0.92 0.07
0.77 0.64 0.16 0.84 0.97 0.40 0.81 3.12 0.53 0.09
0.28 0.72 0.40 0.61 0.29 0.89 0.92 0.53 1.71 0.66
0.55 0.14 0.13 0.30 0.77 0.28 0.07 0.09 0.66 1.78
print_host_matrix : Input matrix X
-nan
-nan
-nan
-nan
-nan
-nan
-nan
-nan
-nan
-nan
print_host_matrix : Input matrix B
1.00
1.00
1.00
1.00
1.00
1.00
1.00
1.00
1.00
1.00
print_host_matrix : Output matrix X
0.03
0.04
0.14
0.07
0.03
0.08
0.09
0.20
0.19
0.42
|b - A*x|_inf = 2.220446E-16
|x|_inf = 1.273718E+00
|b|_inf = 1.000000E+01
|A|_inf = 1.674133E+01
|b - A*x|/(|A|*|x|+|b|) = 7.088702E-18
Setting p
and q
correctly should also work (I was using incompatible values before). I still see the error messages in the middle though (even if the output is correct). Do you know how to address those?
I changed the parameters you mentioned and now the example with 2 GPUs also works
To clarify, it worked before. 🙂 It's a feature, not a bug - ia
, ja
allows you to perform operations on a submatrix if needed. ScaLAPACK explains it well: https://www.netlib.org/scalapack/slug/node79.html.
As for the RDMA error, I am not sure. Probably UCC tries to initialize a backend that doesn't work on your system. UCC tries to initialize what it can, and if it fails it falls back to a working backend. This would explain why the application works correctly.
My suggestions that may help narrow it down are:
export UCC_TLS=^mlx5,sharp
to check if it's coming from one of those layers.export UCC_LOG_LEVEL=DEBUG
for extra UCC debugging output to see if that gives us any clues.Here is the output with those environment variables set.
c318-001.ls6(1007)$ mpiexec -np 2 mp_potrf_potrs -verbose 1 -ia 1 -ib 1 -ja 1 -jb 1
Parameters: m=1 n=10 nrhs=1 mbA=2 nbA=2 mbB=2 nbB=2 mbQ=2 nbQ=2 mbZ=0 nbZ=0ia=1 ja=1 ib=1 jb=1 iq=1 jq=1 iz=0 jz=0 p=2 q=1 grid_layout= verbose=1
Parameters: m=1 n=10 nrhs=1 mbA=2 nbA=2 mbB=2 nbB=2 mbQ=2 nbQ=2 mbZ=0 nbZ=0ia=1 ja=1 ib=1 jb=1 iq=1 jq=1 iz=0 jz=0 p=2 q=1 grid_layout= verbose=1
[1719264274.005250] [c318-001:1866940:0] ucc_proc_info.c:311 UCC DEBUG proc pid 1866940, host c318-001, host_hash 2154568351744862994, sockid 0, numaid 0
[1719264274.005272] [c318-001:1866940:0] ucc_constructor.c:191 UCC INFO version: 1.3.0, loaded from: /scratch/projects/compilers/nvhpc_24.5/Linux_x86_64/24.5/comm_libs/12.4/hpcx/hpcx-2.19/ucc/lib/libucc.so.1, cfg file: /scratch/projects/compilers/nvhpc_24.5/Linux_x86_64/24.5/comm_libs/12.4/hpcx/hpcx-2.19/ucc/share/ucc.conf
[1719264274.005299] [c318-001:1866940:0] ucc_mc.c:67 UCC DEBUG mc cpu mc initialized
[1719264274.005354] [c318-001:1866939:0] ucc_proc_info.c:311 UCC DEBUG proc pid 1866939, host c318-001, host_hash 2154568351744862994, sockid 0, numaid 0
[1719264274.005369] [c318-001:1866939:0] ucc_constructor.c:191 UCC INFO version: 1.3.0, loaded from: /scratch/projects/compilers/nvhpc_24.5/Linux_x86_64/24.5/comm_libs/12.4/hpcx/hpcx-2.19/ucc/lib/libucc.so.1, cfg file: /scratch/projects/compilers/nvhpc_24.5/Linux_x86_64/24.5/comm_libs/12.4/hpcx/hpcx-2.19/ucc/share/ucc.conf
[1719264274.005392] [c318-001:1866939:0] ucc_mc.c:67 UCC DEBUG mc cpu mc initialized
[1719264274.015114] [c318-001:1866940:0] mc_cuda.c:65 cuda mc DEBUG driver version 12020
[1719264274.015126] [c318-001:1866940:0] mc_cuda.c:78 cuda mc DEBUG cuCtxGetDevice() failed: invalid device context
[1719264274.015133] [c318-001:1866940:0] ucc_mc.c:67 UCC DEBUG mc cuda mc initialized
[1719264274.015149] [c318-001:1866940:0] ucc_ec.c:63 UCC DEBUG ec cpu ec initialized
[1719264274.015610] [c318-001:1866939:0] mc_cuda.c:65 cuda mc DEBUG driver version 12020
[1719264274.015618] [c318-001:1866939:0] mc_cuda.c:78 cuda mc DEBUG cuCtxGetDevice() failed: invalid device context
[1719264274.015622] [c318-001:1866939:0] ucc_mc.c:67 UCC DEBUG mc cuda mc initialized
[1719264274.015635] [c318-001:1866939:0] ucc_ec.c:63 UCC DEBUG ec cpu ec initialized
[1719264274.019374] [c318-001:1866940:0] ucc_ec.c:63 UCC DEBUG ec cuda ec initialized
[1719264274.019411] [c318-001:1866940:0] cl_basic_lib.c:20 CL_BASIC DEBUG initialized lib object: 0x13448b0
[1719264274.019423] [c318-001:1866940:0] ucc_lib.c:152 UCC DEBUG lib_prefix "OMPI_UCC_": initialized component "basic" score 10
[1719264274.019453] [c318-001:1866940:0] cl_hier_lib.c:53 CL_HIER DEBUG initialized lib object: 0xfe2ac0
[1719264274.019458] [c318-001:1866940:0] ucc_lib.c:152 UCC DEBUG lib_prefix "OMPI_UCC_": initialized component "hier" score 50
[1719264274.019478] [c318-001:1866940:0] tl_cuda_lib.c:35 TL_CUDA DEBUG initialized lib object: 0x12f9450
[1719264274.019491] [c318-001:1866940:0] tl_self_lib.c:20 TL_SELF DEBUG initialized lib object: 0x12fbd90
[1719264274.019550] [c318-001:1866940:0] tl_ucp_lib.c:69 TL_UCP DEBUG initialized lib object: 0xd23e20
[1719264274.019594] [c318-001:1866940:0] tl_cuda_context.c:43 TL_CUDA DEBUG cannot create CUDA TL context without active CUDA context
[1719264274.019601] [c318-001:1866940:0] ucc_context.c:412 UCC DEBUG failed to create tl context for cuda
[1719264274.020090] [c318-001:1866939:0] ucc_ec.c:63 UCC DEBUG ec cuda ec initialized
[1719264274.020125] [c318-001:1866939:0] cl_basic_lib.c:20 CL_BASIC DEBUG initialized lib object: 0x2b46040
[1719264274.020133] [c318-001:1866939:0] ucc_lib.c:152 UCC DEBUG lib_prefix "OMPI_UCC_": initialized component "basic" score 10
[1719264274.020161] [c318-001:1866939:0] cl_hier_lib.c:53 CL_HIER DEBUG initialized lib object: 0x27d5c20
[1719264274.020165] [c318-001:1866939:0] ucc_lib.c:152 UCC DEBUG lib_prefix "OMPI_UCC_": initialized component "hier" score 50
[1719264274.020182] [c318-001:1866939:0] tl_cuda_lib.c:35 TL_CUDA DEBUG initialized lib object: 0x2ad4c30
[1719264274.020194] [c318-001:1866939:0] tl_self_lib.c:20 TL_SELF DEBUG initialized lib object: 0x2b20130
[1719264274.020249] [c318-001:1866939:0] tl_ucp_lib.c:69 TL_UCP DEBUG initialized lib object: 0x2516e20
[1719264274.020293] [c318-001:1866939:0] tl_cuda_context.c:43 TL_CUDA DEBUG cannot create CUDA TL context without active CUDA context
[1719264274.020299] [c318-001:1866939:0] ucc_context.c:412 UCC DEBUG failed to create tl context for cuda
[1719264274.058242] [c318-001:1866939:0] tl_ucp_context.c:277 TL_UCP DEBUG initialized tl context: 0x2b0ebb0
[1719264274.058273] [c318-001:1866939:0] cl_basic_context.c:39 CL_BASIC DEBUG TL cuda context is not available, skipping
[1719264274.058277] [c318-001:1866939:0] cl_basic_context.c:50 CL_BASIC DEBUG initialized cl context: 0x2b512c0
[1719264274.058287] [c318-001:1866939:0] cl_hier_context.c:43 CL_HIER DEBUG TL cuda context is not available, skipping
[1719264274.058291] [c318-001:1866939:0] cl_hier_context.c:64 CL_HIER DEBUG initialized cl context: 0x2b4aad0
[1719264274.067381] [c318-001:1866940:0] tl_ucp_context.c:277 TL_UCP DEBUG initialized tl context: 0x12f9850
[1719264274.067400] [c318-001:1866940:0] cl_basic_context.c:39 CL_BASIC DEBUG TL cuda context is not available, skipping
[1719264274.067404] [c318-001:1866940:0] cl_basic_context.c:50 CL_BASIC DEBUG initialized cl context: 0x1354f80
[1719264274.067412] [c318-001:1866940:0] cl_hier_context.c:43 CL_HIER DEBUG TL cuda context is not available, skipping
[1719264274.067416] [c318-001:1866940:0] cl_hier_context.c:64 CL_HIER DEBUG initialized cl context: 0x1349230
[1719264274.068571] [c318-001:1866939:0] tl_ucp_team.c:103 TL_UCP DEBUG posted tl team: 0x2e107b0
[1719264274.068577] [c318-001:1866939:0] tl_ucp_team.c:202 TL_UCP DEBUG initialized tl team: 0x2e107b0
[1719264274.068580] [c318-001:1866939:0] ucc_context.c:839 UCC DEBUG created ucc context 0x2b47030 for lib OMPI_UCC_
[1719264274.068582] [c318-001:1866940:0] tl_ucp_team.c:103 TL_UCP DEBUG posted tl team: 0x161c7a0
[1719264274.068588] [c318-001:1866940:0] tl_ucp_team.c:202 TL_UCP DEBUG initialized tl team: 0x161c7a0
[1719264274.068591] [c318-001:1866940:0] ucc_context.c:839 UCC DEBUG created ucc context 0x13458a0 for lib OMPI_UCC_
[1719264274.613824] [c318-001:1866940:0] ucc_team.c:370 UCC DEBUG team 0x1ab7610 rank 1, ctx_rank 1, map_type 1
[1719264274.613825] [c318-001:1866939:0] ucc_team.c:370 UCC DEBUG team 0x32a6230 rank 0, ctx_rank 0, map_type 1
[1719264274.613851] [c318-001:1866939:0] ucc_tl.c:299 TL_SELF DEBUG team size 2 is too big, max supported 1
[1719264274.613859] [c318-001:1866939:0] cl_basic_team.c:52 CL_BASIC DEBUG posted cl team: 0x3203d60
[1719264274.613851] [c318-001:1866940:0] ucc_tl.c:299 TL_SELF DEBUG team size 2 is too big, max supported 1
[1719264274.613859] [c318-001:1866940:0] cl_basic_team.c:52 CL_BASIC DEBUG posted cl team: 0x16c91f0
[1719264274.613881] [c318-001:1866939:0] tl_shm_team.c:158 TL_SHM DEBUG using perf params: generic
[1719264274.622135] [c318-001:1866939:0] tl_ucp_team.c:84 UCC DEBUG section not found
[1719264274.622147] [c318-001:1866939:0] tl_ucp_team.c:100 TL_UCP DEBUG opt knomial radix: 2
[1719264274.622149] [c318-001:1866939:0] tl_ucp_team.c:103 TL_UCP DEBUG posted tl team: 0x32b1de0
[1719264274.622151] [c318-001:1866939:0] tl_ucp_team.c:202 TL_UCP DEBUG initialized tl team: 0x32b1de0
[1719264274.622135] [c318-001:1866940:0] tl_ucp_team.c:84 UCC DEBUG section not found
[1719264274.622147] [c318-001:1866940:0] tl_ucp_team.c:100 TL_UCP DEBUG opt knomial radix: 2
[1719264274.622149] [c318-001:1866940:0] tl_ucp_team.c:103 TL_UCP DEBUG posted tl team: 0x1ad0fc0
[1719264274.622151] [c318-001:1866940:0] tl_ucp_team.c:202 TL_UCP DEBUG initialized tl team: 0x1ad0fc0
[1719264274.622155] [c318-001:1866940:0] cl_basic_team.c:126 CL_BASIC DEBUG failed to create tl self team: (-1)
[1719264274.622158] [c318-001:1866940:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl shm team
[1719264274.622160] [c318-001:1866940:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl ucp team
[1719264274.622155] [c318-001:1866939:0] cl_basic_team.c:126 CL_BASIC DEBUG failed to create tl self team: (-1)
[1719264274.622158] [c318-001:1866939:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl shm team
[1719264274.622160] [c318-001:1866939:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl ucp team
[1719264274.622182] [c318-001:1866939:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type host
[1719264274.622184] [c318-001:1866939:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type cuda
[1719264274.622186] [c318-001:1866939:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type cuda-managed
[1719264274.622182] [c318-001:1866940:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type host
[1719264274.622184] [c318-001:1866940:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type cuda
[1719264274.622185] [c318-001:1866940:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type cuda-managed
[1719264274.622258] [c318-001:1866940:0] cl_hier_team.c:58 CL_HIER DEBUG skipping single node team
[1719264274.622265] [c318-001:1866940:0] ucc_team.c:297 UCC DEBUG failed to create CL hier team
[1719264274.622258] [c318-001:1866939:0] cl_hier_team.c:58 CL_HIER DEBUG skipping single node team
[1719264274.622265] [c318-001:1866939:0] ucc_team.c:297 UCC DEBUG failed to create CL hier team
[1719264274.622275] [c318-001:1866939:0] ucc_team.c:472 UCC INFO ===== COLL_SCORE_MAP (team_id 32768, size 2) =====
[1719264274.622285] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Allgather:
[1719264274.622285] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..4095}:TL_UCP:10 {4K..inf}:TL_UCP:10
[1719264274.622285] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..4095}:TL_UCP:10 {4K..inf}:TL_UCP:10
[1719264274.622285] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..4095}:TL_UCP:10 {4K..inf}:TL_UCP:10
[1719264274.622296] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Allreduce:
[1719264274.622296] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..4095}:TL_SHM:10 {4K..8K}:TL_SHM:10 {8193..inf}:TL_UCP:10
[1719264274.622296] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..4095}:TL_UCP:10 {4K..inf}:TL_UCP:10
[1719264274.622296] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..4095}:TL_UCP:10 {4K..inf}:TL_UCP:10
[1719264274.622305] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Alltoall:
[1719264274.622305] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..257}:TL_UCP:10 {258..inf}:TL_UCP:10
[1719264274.622305] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264274.622305] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264274.622319] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Alltoallv:
[1719264274.622319] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264274.622319] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264274.622319] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264274.622332] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Barrier:
[1719264274.622332] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SHM:10
[1719264274.622332] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264274.622332] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264274.622345] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Bcast:
[1719264274.622345] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..8K}:TL_SHM:10 {8193..inf}:TL_UCP:10
[1719264274.622345] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264274.622345] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264274.622361] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Fanin:
[1719264274.622361] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SHM:10
[1719264274.622361] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264274.622361] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264274.622370] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Fanout:
[1719264274.622370] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SHM:10
[1719264274.622370] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264274.622370] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264274.622382] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Gather:
[1719264274.622382] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264274.622382] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264274.622382] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264274.622393] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Gatherv:
[1719264274.622393] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264274.622393] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264274.622393] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264274.622405] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Reduce:
[1719264274.622405] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..8K}:TL_SHM:10 {8193..inf}:TL_UCP:10
[1719264274.622405] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264274.622405] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264274.622416] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Reduce_scatter:
[1719264274.622416] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264274.622416] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264274.622416] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264274.622428] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Reduce_scatterv:
[1719264274.622428] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264274.622428] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264274.622428] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264274.622435] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Scatterv:
[1719264274.622435] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264274.622435] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264274.622435] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264274.622444] [c318-001:1866939:0] ucc_team.c:474 UCC INFO ================================================
[1719264274.631501] [c318-001:1866939:0] ucc_team.c:370 UCC DEBUG team 0x33166e0 rank 0, ctx_rank 0, map_type 4
[1719264274.631511] [c318-001:1866939:0] ucc_tl.c:299 TL_SELF DEBUG team size 2 is too big, max supported 1
[1719264274.631514] [c318-001:1866939:0] cl_basic_team.c:52 CL_BASIC DEBUG posted cl team: 0x2e429b0
[1719264274.631522] [c318-001:1866939:0] tl_shm_team.c:158 TL_SHM DEBUG using perf params: generic
[1719264274.631501] [c318-001:1866940:0] ucc_team.c:370 UCC DEBUG team 0x1b26830 rank 1, ctx_rank 1, map_type 4
[1719264274.631511] [c318-001:1866940:0] ucc_tl.c:299 TL_SELF DEBUG team size 2 is too big, max supported 1
[1719264274.631513] [c318-001:1866940:0] cl_basic_team.c:52 CL_BASIC DEBUG posted cl team: 0x17ae010
[1719264274.644294] [c318-001:1866940:0] tl_ucp_team.c:84 UCC DEBUG section not found
[1719264274.644298] [c318-001:1866940:0] tl_ucp_team.c:100 TL_UCP DEBUG opt knomial radix: 2
[1719264274.644300] [c318-001:1866940:0] tl_ucp_team.c:103 TL_UCP DEBUG posted tl team: 0x1b26de0
[1719264274.644301] [c318-001:1866940:0] tl_ucp_team.c:202 TL_UCP DEBUG initialized tl team: 0x1b26de0
[1719264274.644304] [c318-001:1866940:0] cl_basic_team.c:126 CL_BASIC DEBUG failed to create tl self team: (-1)
[1719264274.644306] [c318-001:1866940:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl shm team
[1719264274.644308] [c318-001:1866940:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl ucp team
[1719264274.644318] [c318-001:1866940:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type host
[1719264274.644320] [c318-001:1866940:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type cuda
[1719264274.644322] [c318-001:1866940:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type cuda-managed
[1719264274.644363] [c318-001:1866940:0] cl_hier_team.c:58 CL_HIER DEBUG skipping single node team
[1719264274.644366] [c318-001:1866940:0] ucc_team.c:297 UCC DEBUG failed to create CL hier team
[1719264274.644451] [c318-001:1866940:0] tl_shm_team.c:534 TL_SHM DEBUG finalizing tl team: 0xe62990
[1719264274.644463] [c318-001:1866940:0] tl_ucp_team.c:110 TL_UCP DEBUG finalizing tl team: 0x1b26de0
[1719264274.644474] [c318-001:1866940:0] cl_basic_team.c:61 CL_BASIC DEBUG finalizing cl team: 0x17ae010
[1719264274.644536] [c318-001:1866939:0] tl_ucp_team.c:84 UCC DEBUG section not found
[1719264274.644540] [c318-001:1866939:0] tl_ucp_team.c:100 TL_UCP DEBUG opt knomial radix: 2
[1719264274.644542] [c318-001:1866939:0] tl_ucp_team.c:103 TL_UCP DEBUG posted tl team: 0x3317080
[1719264274.644545] [c318-001:1866939:0] tl_ucp_team.c:202 TL_UCP DEBUG initialized tl team: 0x3317080
[1719264274.644548] [c318-001:1866939:0] cl_basic_team.c:126 CL_BASIC DEBUG failed to create tl self team: (-1)
[1719264274.644550] [c318-001:1866939:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl shm team
[1719264274.644553] [c318-001:1866939:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl ucp team
[1719264274.644559] [c318-001:1866939:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type host
[1719264274.644562] [c318-001:1866939:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type cuda
[1719264274.644565] [c318-001:1866939:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type cuda-managed
[1719264274.644599] [c318-001:1866939:0] cl_hier_team.c:58 CL_HIER DEBUG skipping single node team
[1719264274.644602] [c318-001:1866939:0] ucc_team.c:297 UCC DEBUG failed to create CL hier team
[1719264274.644609] [c318-001:1866939:0] ucc_team.c:472 UCC INFO ===== COLL_SCORE_MAP (team_id 32771, size 2) =====
[1719264274.644616] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Allgather:
[1719264274.644616] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..4095}:TL_UCP:10 {4K..inf}:TL_UCP:10
[1719264274.644616] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..4095}:TL_UCP:10 {4K..inf}:TL_UCP:10
[1719264274.644616] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..4095}:TL_UCP:10 {4K..inf}:TL_UCP:10
[1719264274.644629] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Allreduce:
[1719264274.644629] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..4095}:TL_SHM:10 {4K..8K}:TL_SHM:10 {8193..inf}:TL_UCP:10
[1719264274.644629] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..4095}:TL_UCP:10 {4K..inf}:TL_UCP:10
[1719264274.644629] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..4095}:TL_UCP:10 {4K..inf}:TL_UCP:10
[1719264274.644643] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Alltoall:
[1719264274.644643] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..257}:TL_UCP:10 {258..inf}:TL_UCP:10
[1719264274.644643] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264274.644643] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264274.644652] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Alltoallv:
[1719264274.644652] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264274.644652] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264274.644652] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264274.644659] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Barrier:
[1719264274.644659] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SHM:10
[1719264274.644659] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264274.644659] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264274.644670] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Bcast:
[1719264274.644670] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..8K}:TL_SHM:10 {8193..inf}:TL_UCP:10
[1719264274.644670] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264274.644670] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264274.644676] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Fanin:
[1719264274.644676] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SHM:10
[1719264274.644676] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264274.644676] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264274.644686] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Fanout:
[1719264274.644686] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SHM:10
[1719264274.644686] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264274.644686] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264274.644697] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Gather:
[1719264274.644697] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264274.644697] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264274.644697] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264274.644705] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Gatherv:
[1719264274.644705] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264274.644705] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264274.644705] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264274.644718] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Reduce:
[1719264274.644718] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..8K}:TL_SHM:10 {8193..inf}:TL_UCP:10
[1719264274.644718] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264274.644718] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264274.644726] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Reduce_scatter:
[1719264274.644726] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264274.644726] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264274.644726] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264274.644736] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Reduce_scatterv:
[1719264274.644736] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264274.644736] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264274.644736] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264274.644747] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Scatterv:
[1719264274.644747] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264274.644747] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264274.644747] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264274.644759] [c318-001:1866939:0] ucc_team.c:474 UCC INFO ================================================
[1719264274.644838] [c318-001:1866939:0] tl_shm_team.c:534 TL_SHM DEBUG finalizing tl team: 0x2655990
[1719264274.644847] [c318-001:1866939:0] tl_ucp_team.c:110 TL_UCP DEBUG finalizing tl team: 0x3317080
[1719264274.644855] [c318-001:1866939:0] cl_basic_team.c:61 CL_BASIC DEBUG finalizing cl team: 0x2e429b0
[1719264274.644859] [c318-001:1866939:0] ucc_team.c:528 UCC INFO team destroyed, team_id 32771
[1719264275.064913] [c318-001:1866939:0] ucc_mc.c:78 UCC WARN mc cpu mc was allready initilized with different thread mode: current tm 0, provided tm 2
[1719264275.064927] [c318-001:1866939:0] ucc_mc.c:78 UCC WARN mc cuda mc was allready initilized with different thread mode: current tm 0, provided tm 2
[1719264275.064930] [c318-001:1866939:0] ucc_ec.c:75 UCC WARN ec cpu ec was allready initilized with different thread mode: current tm 0, provided tm 2
[1719264275.064933] [c318-001:1866939:0] ucc_ec.c:75 UCC WARN ec cuda ec was allready initilized with different thread mode: current tm 0, provided tm 2
[1719264275.064948] [c318-001:1866939:0] cl_basic_lib.c:20 CL_BASIC DEBUG initialized lib object: 0x37479f0
[1719264275.064914] [c318-001:1866940:0] ucc_mc.c:78 UCC WARN mc cpu mc was allready initilized with different thread mode: current tm 0, provided tm 2
[1719264275.064927] [c318-001:1866940:0] ucc_mc.c:78 UCC WARN mc cuda mc was allready initilized with different thread mode: current tm 0, provided tm 2
[1719264275.064931] [c318-001:1866940:0] ucc_ec.c:75 UCC WARN ec cpu ec was allready initilized with different thread mode: current tm 0, provided tm 2
[1719264275.064933] [c318-001:1866940:0] ucc_ec.c:75 UCC WARN ec cuda ec was allready initilized with different thread mode: current tm 0, provided tm 2
[1719264275.064949] [c318-001:1866940:0] cl_basic_lib.c:20 CL_BASIC DEBUG initialized lib object: 0x1f57f10
[1719264275.064956] [c318-001:1866940:0] ucc_lib.c:152 UCC DEBUG lib_prefix "CAL_UCC_": initialized component "basic" score 10
[1719264275.064956] [c318-001:1866939:0] ucc_lib.c:152 UCC DEBUG lib_prefix "CAL_UCC_": initialized component "basic" score 10
[1719264275.064971] [c318-001:1866939:0] tl_cuda_lib.c:35 TL_CUDA DEBUG initialized lib object: 0x3c9eed0
[1719264275.064987] [c318-001:1866939:0] tl_nccl_lib.c:16 TL_NCCL DEBUG initialized lib object: 0x3f5c3f0
[1719264275.064971] [c318-001:1866940:0] tl_cuda_lib.c:35 TL_CUDA DEBUG initialized lib object: 0x24afb40
[1719264275.064987] [c318-001:1866940:0] tl_nccl_lib.c:16 TL_NCCL DEBUG initialized lib object: 0x2739f40
[1719264275.064995] [c318-001:1866940:0] tl_self_lib.c:20 TL_SELF DEBUG initialized lib object: 0x24999d0
[1719264275.064995] [c318-001:1866939:0] tl_self_lib.c:20 TL_SELF DEBUG initialized lib object: 0x3c89030
[1719264275.065036] [c318-001:1866940:0] tl_ucp_lib.c:69 TL_UCP DEBUG initialized lib object: 0x27661c0
[1719264275.065036] [c318-001:1866939:0] tl_ucp_lib.c:69 TL_UCP DEBUG initialized lib object: 0x3f28e70
[1719264275.065066] [c318-001:1866939:0] ucc_context.c:247 UCC INFO required TL sharp is not part of the context
[1719264275.065070] [c318-001:1866939:0] ucc_context.c:247 UCC INFO required TL hcoll is not part of the context
[1719264275.065066] [c318-001:1866940:0] ucc_context.c:247 UCC INFO required TL sharp is not part of the context
[1719264275.065070] [c318-001:1866940:0] ucc_context.c:247 UCC INFO required TL hcoll is not part of the context
[1719264275.095681] [c318-001:1866940:0] tl_cuda_context.c:71 TL_CUDA DEBUG initialized tl context: 0x2152ef0
[1719264275.095698] [c318-001:1866940:0] tl_nccl_context.c:182 TL_NCCL DEBUG using memops completion sync
[1719264275.098553] [c318-001:1866940:0] tl_nccl_context.c:205 TL_NCCL DEBUG initialized tl context: 0x1f46270
[1719264275.098818] [c318-001:1866939:0] tl_cuda_context.c:71 TL_CUDA DEBUG initialized tl context: 0x39429d0
[1719264275.098832] [c318-001:1866939:0] tl_nccl_context.c:182 TL_NCCL DEBUG using memops completion sync
[1719264275.098948] [c318-001:1866939:0] tl_nccl_context.c:205 TL_NCCL DEBUG initialized tl context: 0x3736120
[1719264275.135466] [c318-001:1866939:0] tl_ucp_context.c:277 TL_UCP DEBUG initialized tl context: 0x3712910
[1719264275.135478] [c318-001:1866939:0] cl_basic_context.c:50 CL_BASIC DEBUG initialized cl context: 0x3f68830
[1719264275.146446] [c318-001:1866940:0] tl_ucp_context.c:277 TL_UCP DEBUG initialized tl context: 0x1f22a60
[1719264275.146458] [c318-001:1866940:0] cl_basic_context.c:50 CL_BASIC DEBUG initialized cl context: 0x2779590
[1719264275.146559] [c318-001:1866940:0] tl_ucp_team.c:103 TL_UCP DEBUG posted tl team: 0x2d76470
[1719264275.146563] [c318-001:1866940:0] tl_ucp_team.c:202 TL_UCP DEBUG initialized tl team: 0x2d76470
[1719264275.146565] [c318-001:1866940:0] ucc_context.c:839 UCC DEBUG created ucc context 0x276dc70 for lib CAL_UCC_
[1719264275.146559] [c318-001:1866939:0] tl_ucp_team.c:103 TL_UCP DEBUG posted tl team: 0x4376060
[1719264275.146563] [c318-001:1866939:0] tl_ucp_team.c:202 TL_UCP DEBUG initialized tl team: 0x4376060
[1719264275.146565] [c318-001:1866939:0] ucc_context.c:839 UCC DEBUG created ucc context 0x3f5cff0 for lib CAL_UCC_
[1719264275.182000] [c318-001:1866939:0] tl_ucp_context.c:277 TL_UCP DEBUG initialized tl context: 0x41b7010
[1719264275.182013] [c318-001:1866939:0] cl_basic_context.c:39 CL_BASIC DEBUG TL cuda context is not available, skipping
[1719264275.182015] [c318-001:1866939:0] cl_basic_context.c:39 CL_BASIC DEBUG TL nccl context is not available, skipping
[1719264275.182017] [c318-001:1866939:0] cl_basic_context.c:50 CL_BASIC DEBUG initialized cl context: 0x41d6fb0
[1719264275.194596] [c318-001:1866940:0] tl_ucp_context.c:277 TL_UCP DEBUG initialized tl context: 0x29c8010
[1719264275.194607] [c318-001:1866940:0] cl_basic_context.c:39 CL_BASIC DEBUG TL cuda context is not available, skipping
[1719264275.194609] [c318-001:1866940:0] cl_basic_context.c:39 CL_BASIC DEBUG TL nccl context is not available, skipping
[1719264275.194611] [c318-001:1866940:0] cl_basic_context.c:50 CL_BASIC DEBUG initialized cl context: 0x29e7c10
[1719264275.194652] [c318-001:1866940:0] tl_ucp_team.c:103 TL_UCP DEBUG posted tl team: 0x340e050
[1719264275.194655] [c318-001:1866940:0] tl_ucp_team.c:202 TL_UCP DEBUG initialized tl team: 0x340e050
[1719264275.194657] [c318-001:1866940:0] ucc_context.c:839 UCC DEBUG created ucc context 0x2d76940 for lib CAL_UCC_
[1719264275.194652] [c318-001:1866939:0] tl_ucp_team.c:103 TL_UCP DEBUG posted tl team: 0x4b62ac0
[1719264275.194656] [c318-001:1866939:0] tl_ucp_team.c:202 TL_UCP DEBUG initialized tl team: 0x4b62ac0
[1719264275.194658] [c318-001:1866939:0] ucc_context.c:839 UCC DEBUG created ucc context 0x456dd30 for lib CAL_UCC_
[1719264275.194676] [c318-001:1866939:0] ucc_team.c:370 UCC DEBUG team 0x4c23050 rank 0, ctx_rank 0, map_type 3
[1719264275.194678] [c318-001:1866940:0] ucc_team.c:370 UCC DEBUG team 0x340e520 rank 1, ctx_rank 1, map_type 3
[1719264275.202804] [c318-001:1866940:0] tl_cuda_team.c:109 TL_CUDA DEBUG posted tl team: 0x3438050
[1719264275.202814] [c318-001:1866940:0] cl_basic_team.c:52 CL_BASIC DEBUG posted cl team: 0x1f4fde0
[1719264275.202842] [c318-001:1866940:0] tl_cuda_team_topo.c:315 TL_CUDA DEBUG no proxy found between dev 0000:21:00.0 (0) and dev 0000:e1:00.0 (1), cuda topology is not supported
[1719264275.202849] [c318-001:1866940:0] tl_cuda_team.c:134 TL_CUDA DEBUG finalizing tl team: 0x3438050
[1719264275.203302] [c318-001:1866939:0] tl_cuda_team.c:109 TL_CUDA DEBUG posted tl team: 0x4c23880
[1719264275.203310] [c318-001:1866939:0] cl_basic_team.c:52 CL_BASIC DEBUG posted cl team: 0x373fc20
[1719264275.203329] [c318-001:1866939:0] tl_cuda_team_topo.c:315 TL_CUDA DEBUG no proxy found between dev 0000:21:00.0 (0) and dev 0000:e1:00.0 (1), cuda topology is not supported
[1719264275.203335] [c318-001:1866939:0] tl_cuda_team.c:134 TL_CUDA DEBUG finalizing tl team: 0x4c23880
[1719264275.290771] [c318-001:1866940:0] ucc_tl.c:299 TL_SELF DEBUG team size 2 is too big, max supported 1
[1719264275.290771] [c318-001:1866939:0] ucc_tl.c:299 TL_SELF DEBUG team size 2 is too big, max supported 1
[1719264275.290790] [c318-001:1866939:0] tl_shm_team.c:158 TL_SHM DEBUG using perf params: generic
[1719264275.290845] [c318-001:1866940:0] tl_ucp_team.c:84 UCC DEBUG section not found
[1719264275.290849] [c318-001:1866940:0] tl_ucp_team.c:100 TL_UCP DEBUG opt knomial radix: 2
[1719264275.290851] [c318-001:1866940:0] tl_ucp_team.c:103 TL_UCP DEBUG posted tl team: 0x34ff090
[1719264275.290853] [c318-001:1866940:0] tl_ucp_team.c:202 TL_UCP DEBUG initialized tl team: 0x34ff090
[1719264275.290855] [c318-001:1866940:0] cl_basic_team.c:126 CL_BASIC DEBUG failed to create tl cuda team: (-1)
[1719264275.290858] [c318-001:1866940:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl nccl team
[1719264275.290845] [c318-001:1866939:0] tl_ucp_team.c:84 UCC DEBUG section not found
[1719264275.290849] [c318-001:1866939:0] tl_ucp_team.c:100 TL_UCP DEBUG opt knomial radix: 2
[1719264275.290851] [c318-001:1866939:0] tl_ucp_team.c:103 TL_UCP DEBUG posted tl team: 0x4cea090
[1719264275.290853] [c318-001:1866939:0] tl_ucp_team.c:202 TL_UCP DEBUG initialized tl team: 0x4cea090
[1719264275.290855] [c318-001:1866939:0] cl_basic_team.c:126 CL_BASIC DEBUG failed to create tl cuda team: (-1)
[1719264275.290857] [c318-001:1866939:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl nccl team
[1719264275.290859] [c318-001:1866939:0] cl_basic_team.c:126 CL_BASIC DEBUG failed to create tl self team: (-1)
[1719264275.290861] [c318-001:1866939:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl shm team
[1719264275.290862] [c318-001:1866939:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl ucp team
[1719264275.290860] [c318-001:1866940:0] cl_basic_team.c:126 CL_BASIC DEBUG failed to create tl self team: (-1)
[1719264275.290862] [c318-001:1866940:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl shm team
[1719264275.290865] [c318-001:1866940:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl ucp team
[1719264275.290893] [c318-001:1866940:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type host
[1719264275.290896] [c318-001:1866940:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type cuda
[1719264275.290898] [c318-001:1866940:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type cuda-managed
[1719264275.290893] [c318-001:1866939:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type host
[1719264275.290896] [c318-001:1866939:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type cuda
[1719264275.290898] [c318-001:1866939:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type cuda-managed
[1719264275.290941] [c318-001:1866939:0] ucc_team.c:472 UCC INFO ===== COLL_SCORE_MAP (team_id 32768, size 2) =====
[1719264275.290951] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Allgather:
[1719264275.290951] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..4095}:TL_UCP:10 {4K..inf}:TL_UCP:10
[1719264275.290951] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..4095}:TL_NCCL:10 {4K..inf}:TL_NCCL:10
[1719264275.290951] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..4095}:TL_NCCL:10 {4K..inf}:TL_NCCL:10
[1719264275.290963] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Allgatherv:
[1719264275.290963] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.290963] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..16383}:TL_NCCL:10 {16K..1048575}:TL_NCCL:10 {1M..inf}:TL_NCCL:10
[1719264275.290963] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_NCCL:10
[1719264275.290975] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Allreduce:
[1719264275.290975] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..4095}:TL_SHM:10 {4K..8K}:TL_SHM:10 {8193..inf}:TL_UCP:10
[1719264275.290975] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..4095}:TL_NCCL:10 {4K..inf}:TL_NCCL:10
[1719264275.290975] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..4095}:TL_NCCL:10 {4K..inf}:TL_NCCL:10
[1719264275.290986] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Alltoall:
[1719264275.290986] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..257}:TL_UCP:10 {258..inf}:TL_UCP:10
[1719264275.290986] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_NCCL:10
[1719264275.290986] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_NCCL:10
[1719264275.290997] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Alltoallv:
[1719264275.290997] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.290997] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_NCCL:10
[1719264275.290997] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_NCCL:10
[1719264275.291008] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Barrier:
[1719264275.291008] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SHM:10
[1719264275.291008] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_NCCL:10
[1719264275.291008] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_NCCL:10
[1719264275.291015] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Bcast:
[1719264275.291015] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..8K}:TL_SHM:10 {8193..inf}:TL_UCP:10
[1719264275.291015] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_NCCL:10
[1719264275.291015] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_NCCL:10
[1719264275.291030] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Fanin:
[1719264275.291030] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SHM:10
[1719264275.291030] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.291030] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.291043] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Fanout:
[1719264275.291043] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SHM:10
[1719264275.291043] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.291043] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.291050] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Gather:
[1719264275.291050] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.291050] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_NCCL:10
[1719264275.291050] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_NCCL:10
[1719264275.291062] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Gatherv:
[1719264275.291062] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.291062] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_NCCL:10
[1719264275.291062] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_NCCL:10
[1719264275.291075] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Reduce:
[1719264275.291075] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..8K}:TL_SHM:10 {8193..inf}:TL_UCP:10
[1719264275.291075] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_NCCL:10
[1719264275.291075] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_NCCL:10
[1719264275.291089] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Reduce_scatter:
[1719264275.291089] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.291089] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_NCCL:10
[1719264275.291089] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_NCCL:10
[1719264275.291100] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Reduce_scatterv:
[1719264275.291100] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.291100] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.291100] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.291113] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Scatter:
[1719264275.291113] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_NCCL:10
[1719264275.291113] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_NCCL:10
[1719264275.291122] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Scatterv:
[1719264275.291122] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.291122] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_NCCL:10
[1719264275.291122] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_NCCL:10
[1719264275.291133] [c318-001:1866939:0] ucc_team.c:474 UCC INFO ================================================
print_host_matrix : Input matrix A
11.72 0.39 0.78 0.80 0.91 0.20 0.34 0.77 0.28 0.55
0.39 10.69 0.63 0.36 0.51 0.95 0.92 0.64 0.72 0.14
0.78 0.63 4.99 0.02 0.24 0.14 0.80 0.16 0.40 0.13
0.80 0.36 0.02 7.17 1.00 0.22 0.51 0.84 0.61 0.30
0.91 0.51 0.24 1.00 7.38 0.52 0.49 0.97 0.29 0.77
0.20 0.95 0.14 0.22 0.52 5.74 0.77 0.40 0.89 0.28
0.34 0.92 0.80 0.51 0.49 0.77 4.30 0.81 0.92 0.07
0.77 0.64 0.16 0.84 0.97 0.40 0.81 3.12 0.53 0.09
0.28 0.72 0.40 0.61 0.29 0.89 0.92 0.53 1.71 0.66
0.55 0.14 0.13 0.30 0.77 0.28 0.07 0.09 0.66 1.78
print_host_matrix : Input matrix X
-nan
-nan
-nan
-nan
-nan
-nan
-nan
-nan
-nan
-nan
print_host_matrix : Input matrix B
1.00
1.00
1.00
1.00
1.00
1.00
1.00
1.00
1.00
1.00
[1719264275.680628] [c318-001:1866940:0] ucc_team.c:370 UCC DEBUG team 0x3c55480 rank 1, ctx_rank 1, map_type 3
[1719264275.680648] [c318-001:1866940:0] ucc_tl.c:299 TL_SELF DEBUG team size 2 is too big, max supported 1
[1719264275.680652] [c318-001:1866940:0] cl_basic_team.c:52 CL_BASIC DEBUG posted cl team: 0x17ae010
[1719264275.680650] [c318-001:1866939:0] ucc_team.c:370 UCC DEBUG team 0x5443ed0 rank 0, ctx_rank 0, map_type 3
[1719264275.680661] [c318-001:1866939:0] ucc_tl.c:299 TL_SELF DEBUG team size 2 is too big, max supported 1
[1719264275.680665] [c318-001:1866939:0] cl_basic_team.c:52 CL_BASIC DEBUG posted cl team: 0x3689cc0
[1719264275.680674] [c318-001:1866939:0] tl_shm_team.c:158 TL_SHM DEBUG using perf params: generic
[1719264275.689127] [c318-001:1866939:0] tl_ucp_team.c:84 UCC DEBUG section not found
[1719264275.689137] [c318-001:1866939:0] tl_ucp_team.c:100 TL_UCP DEBUG opt knomial radix: 2
[1719264275.689140] [c318-001:1866939:0] tl_ucp_team.c:103 TL_UCP DEBUG posted tl team: 0x544f840
[1719264275.689141] [c318-001:1866939:0] tl_ucp_team.c:202 TL_UCP DEBUG initialized tl team: 0x544f840
[1719264275.689127] [c318-001:1866940:0] tl_ucp_team.c:84 UCC DEBUG section not found
[1719264275.689137] [c318-001:1866940:0] tl_ucp_team.c:100 TL_UCP DEBUG opt knomial radix: 2
[1719264275.689140] [c318-001:1866940:0] tl_ucp_team.c:103 TL_UCP DEBUG posted tl team: 0x3ca6b10
[1719264275.689142] [c318-001:1866940:0] tl_ucp_team.c:202 TL_UCP DEBUG initialized tl team: 0x3ca6b10
[1719264275.689145] [c318-001:1866940:0] cl_basic_team.c:126 CL_BASIC DEBUG failed to create tl self team: (-1)
[1719264275.689148] [c318-001:1866940:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl shm team
[1719264275.689149] [c318-001:1866940:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl ucp team
[1719264275.689152] [c318-001:1866940:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type host
[1719264275.689153] [c318-001:1866940:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type cuda
[1719264275.689155] [c318-001:1866940:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type cuda-managed
[1719264275.689145] [c318-001:1866939:0] cl_basic_team.c:126 CL_BASIC DEBUG failed to create tl self team: (-1)
[1719264275.689148] [c318-001:1866939:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl shm team
[1719264275.689149] [c318-001:1866939:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl ucp team
[1719264275.689152] [c318-001:1866939:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type host
[1719264275.689154] [c318-001:1866939:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type cuda
[1719264275.689155] [c318-001:1866939:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type cuda-managed
[1719264275.689200] [c318-001:1866939:0] ucc_team.c:472 UCC INFO ===== COLL_SCORE_MAP (team_id 32769, size 2) =====
[1719264275.689209] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Allgather:
[1719264275.689209] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..4095}:TL_UCP:10 {4K..inf}:TL_UCP:10
[1719264275.689209] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..4095}:TL_UCP:10 {4K..inf}:TL_UCP:10
[1719264275.689209] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..4095}:TL_UCP:10 {4K..inf}:TL_UCP:10
[1719264275.689220] [c318-001:1866940:0] ucc_coll_score_map.c:144 UCC DEBUG coll Bcast is not supported for TL_SHM, fallback TL_UCP
[1719264275.689221] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Allgatherv:
[1719264275.689221] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.689221] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.689221] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.689231] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Allreduce:
[1719264275.689231] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..4095}:TL_SHM:10 {4K..8K}:TL_SHM:10 {8193..inf}:TL_UCP:10
[1719264275.689231] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..4095}:TL_UCP:10 {4K..inf}:TL_UCP:10
[1719264275.689231] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..4095}:TL_UCP:10 {4K..inf}:TL_UCP:10
[1719264275.689245] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Alltoall:
[1719264275.689245] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..257}:TL_UCP:10 {258..inf}:TL_UCP:10
[1719264275.689245] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.689245] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.689263] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Alltoallv:
[1719264275.689263] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.689263] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.689263] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.689272] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Barrier:
[1719264275.689272] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SHM:10
[1719264275.689272] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.689272] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.689283] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Bcast:
[1719264275.689283] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..8K}:TL_SHM:10 {8193..inf}:TL_UCP:10
[1719264275.689283] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.689283] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.689293] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Fanin:
[1719264275.689293] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SHM:10
[1719264275.689293] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.689293] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.689306] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Fanout:
[1719264275.689306] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SHM:10
[1719264275.689306] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.689306] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.689319] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Gather:
[1719264275.689319] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.689319] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.689319] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.689328] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Gatherv:
[1719264275.689328] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.689328] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.689328] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.689339] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Reduce:
[1719264275.689339] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..8K}:TL_SHM:10 {8193..inf}:TL_UCP:10
[1719264275.689339] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.689339] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.689354] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Reduce_scatter:
[1719264275.689354] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.689354] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.689354] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.689366] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Reduce_scatterv:
[1719264275.689366] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.689366] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.689366] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.689379] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Scatterv:
[1719264275.689379] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.689379] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.689379] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.689388] [c318-001:1866939:0] ucc_team.c:474 UCC INFO ================================================
[1719264275.689592] [c318-001:1866939:0] ucc_coll_score_map.c:144 UCC DEBUG coll Bcast is not supported for TL_SHM, fallback TL_UCP
[1719264275.689641] [c318-001:1866939:0] ucc_coll_score_map.c:144 UCC DEBUG coll Bcast is not supported for TL_SHM, fallback TL_UCP
[1719264275.689663] [c318-001:1866940:0] ucc_coll_score_map.c:144 UCC DEBUG coll Bcast is not supported for TL_SHM, fallback TL_UCP
[1719264275.689697] [c318-001:1866940:0] ucc_team.c:370 UCC DEBUG team 0x3c64e90 rank 1, ctx_rank 1, map_type 3
[1719264275.689697] [c318-001:1866939:0] ucc_team.c:370 UCC DEBUG team 0x5453fa0 rank 0, ctx_rank 0, map_type 3
[1719264275.689944] [c318-001:1866940:0] tl_cuda_team.c:109 TL_CUDA DEBUG posted tl team: 0x3c65930
[1719264275.689948] [c318-001:1866940:0] cl_basic_team.c:52 CL_BASIC DEBUG posted cl team: 0x164ec60
[1719264275.689955] [c318-001:1866939:0] tl_cuda_team.c:109 TL_CUDA DEBUG posted tl team: 0x5454a40
[1719264275.689959] [c318-001:1866939:0] cl_basic_team.c:52 CL_BASIC DEBUG posted cl team: 0x2e429b0
[1719264275.689963] [c318-001:1866939:0] tl_cuda_team_topo.c:315 TL_CUDA DEBUG no proxy found between dev 0000:21:00.0 (0) and dev 0000:e1:00.0 (1), cuda topology is not supported
[1719264275.689957] [c318-001:1866940:0] tl_cuda_team_topo.c:315 TL_CUDA DEBUG no proxy found between dev 0000:21:00.0 (0) and dev 0000:e1:00.0 (1), cuda topology is not supported
[1719264275.689960] [c318-001:1866940:0] tl_cuda_team.c:134 TL_CUDA DEBUG finalizing tl team: 0x3c65930
[1719264275.689966] [c318-001:1866939:0] tl_cuda_team.c:134 TL_CUDA DEBUG finalizing tl team: 0x5454a40
[1719264275.690245] [c318-001:1866940:0] ucc_tl.c:299 TL_SELF DEBUG team size 2 is too big, max supported 1
[1719264275.690247] [c318-001:1866939:0] ucc_tl.c:299 TL_SELF DEBUG team size 2 is too big, max supported 1
[1719264275.690257] [c318-001:1866939:0] tl_shm_team.c:158 TL_SHM DEBUG using perf params: generic
[1719264275.690291] [c318-001:1866939:0] tl_ucp_team.c:84 UCC DEBUG section not found
[1719264275.690294] [c318-001:1866939:0] tl_ucp_team.c:100 TL_UCP DEBUG opt knomial radix: 2
[1719264275.690296] [c318-001:1866939:0] tl_ucp_team.c:103 TL_UCP DEBUG posted tl team: 0x54ad360
[1719264275.690298] [c318-001:1866939:0] tl_ucp_team.c:202 TL_UCP DEBUG initialized tl team: 0x54ad360
[1719264275.690291] [c318-001:1866940:0] tl_ucp_team.c:84 UCC DEBUG section not found
[1719264275.690294] [c318-001:1866940:0] tl_ucp_team.c:100 TL_UCP DEBUG opt knomial radix: 2
[1719264275.690296] [c318-001:1866940:0] tl_ucp_team.c:103 TL_UCP DEBUG posted tl team: 0x3c66850
[1719264275.690297] [c318-001:1866940:0] tl_ucp_team.c:202 TL_UCP DEBUG initialized tl team: 0x3c66850
[1719264275.690300] [c318-001:1866940:0] cl_basic_team.c:126 CL_BASIC DEBUG failed to create tl cuda team: (-1)
[1719264275.690303] [c318-001:1866940:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl nccl team
[1719264275.690305] [c318-001:1866940:0] cl_basic_team.c:126 CL_BASIC DEBUG failed to create tl self team: (-1)
[1719264275.690308] [c318-001:1866940:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl shm team
[1719264275.690310] [c318-001:1866940:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl ucp team
[1719264275.690301] [c318-001:1866939:0] cl_basic_team.c:126 CL_BASIC DEBUG failed to create tl cuda team: (-1)
[1719264275.690305] [c318-001:1866939:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl nccl team
[1719264275.690306] [c318-001:1866939:0] cl_basic_team.c:126 CL_BASIC DEBUG failed to create tl self team: (-1)
[1719264275.690310] [c318-001:1866939:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl shm team
[1719264275.690312] [c318-001:1866939:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl ucp team
[1719264275.690325] [c318-001:1866940:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type host
[1719264275.690328] [c318-001:1866940:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type cuda
[1719264275.690330] [c318-001:1866940:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type cuda-managed
[1719264275.690326] [c318-001:1866939:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type host
[1719264275.690330] [c318-001:1866939:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type cuda
[1719264275.690332] [c318-001:1866939:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type cuda-managed
[1719264275.690371] [c318-001:1866939:0] ucc_team.c:472 UCC INFO ===== COLL_SCORE_MAP (team_id 32770, size 2) =====
[1719264275.690378] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Allgather:
[1719264275.690378] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..4095}:TL_UCP:10 {4K..inf}:TL_UCP:10
[1719264275.690378] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..4095}:TL_NCCL:10 {4K..inf}:TL_NCCL:10
[1719264275.690378] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..4095}:TL_NCCL:10 {4K..inf}:TL_NCCL:10
[1719264275.690390] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Allgatherv:
[1719264275.690390] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.690390] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..16383}:TL_NCCL:10 {16K..1048575}:TL_NCCL:10 {1M..inf}:TL_NCCL:10
[1719264275.690390] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_NCCL:10
[1719264275.690399] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Allreduce:
[1719264275.690399] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..4095}:TL_SHM:10 {4K..8K}:TL_SHM:10 {8193..inf}:TL_UCP:10
[1719264275.690399] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..4095}:TL_NCCL:10 {4K..inf}:TL_NCCL:10
[1719264275.690399] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..4095}:TL_NCCL:10 {4K..inf}:TL_NCCL:10
[1719264275.690409] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Alltoall:
[1719264275.690409] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..257}:TL_UCP:10 {258..inf}:TL_UCP:10
[1719264275.690409] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_NCCL:10
[1719264275.690409] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_NCCL:10
[1719264275.690417] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Alltoallv:
[1719264275.690417] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.690417] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_NCCL:10
[1719264275.690417] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_NCCL:10
[1719264275.690428] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Barrier:
[1719264275.690428] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SHM:10
[1719264275.690428] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_NCCL:10
[1719264275.690428] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_NCCL:10
[1719264275.690437] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Bcast:
[1719264275.690437] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..8K}:TL_SHM:10 {8193..inf}:TL_UCP:10
[1719264275.690437] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_NCCL:10
[1719264275.690437] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_NCCL:10
[1719264275.690449] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Fanin:
[1719264275.690449] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SHM:10
[1719264275.690449] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.690449] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.690460] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Fanout:
[1719264275.690460] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SHM:10
[1719264275.690460] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.690460] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.690469] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Gather:
[1719264275.690469] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.690469] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_NCCL:10
[1719264275.690469] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_NCCL:10
[1719264275.690478] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Gatherv:
[1719264275.690478] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.690478] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_NCCL:10
[1719264275.690478] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_NCCL:10
[1719264275.690489] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Reduce:
[1719264275.690489] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..8K}:TL_SHM:10 {8193..inf}:TL_UCP:10
[1719264275.690489] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_NCCL:10
[1719264275.690489] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_NCCL:10
[1719264275.690498] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Reduce_scatter:
[1719264275.690498] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.690498] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_NCCL:10
[1719264275.690498] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_NCCL:10
[1719264275.690514] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Reduce_scatterv:
[1719264275.690514] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.690514] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.690514] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.690526] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Scatter:
[1719264275.690526] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_NCCL:10
[1719264275.690526] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_NCCL:10
[1719264275.690534] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Scatterv:
[1719264275.690534] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.690534] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_NCCL:10
[1719264275.690534] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_NCCL:10
[1719264275.690542] [c318-001:1866939:0] ucc_team.c:474 UCC INFO ================================================
[1719264275.690562] [c318-001:1866939:0] tl_self_team.c:19 TL_SELF DEBUG posted tl team: 0x3f6b610
[1719264275.690565] [c318-001:1866939:0] cl_basic_team.c:52 CL_BASIC DEBUG posted cl team: 0x2627e50
[1719264275.690567] [c318-001:1866939:0] tl_self_team.c:42 TL_SELF DEBUG initialized tl team: 0x3f6b610
[1719264275.690569] [c318-001:1866939:0] ucc_tl.c:293 TL_SHM DEBUG team size 1 is too small, min supported 2
[1719264275.690563] [c318-001:1866940:0] tl_self_team.c:19 TL_SELF DEBUG posted tl team: 0x277d150
[1719264275.690566] [c318-001:1866940:0] cl_basic_team.c:52 CL_BASIC DEBUG posted cl team: 0xe34e50
[1719264275.690568] [c318-001:1866940:0] tl_self_team.c:42 TL_SELF DEBUG initialized tl team: 0x277d150
[1719264275.690570] [c318-001:1866940:0] ucc_tl.c:293 TL_SHM DEBUG team size 1 is too small, min supported 2
[1719264275.690575] [c318-001:1866940:0] ucc_tl.c:293 TL_UCP DEBUG team size 1 is too small, min supported 2
[1719264275.690577] [c318-001:1866940:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl self team
[1719264275.690579] [c318-001:1866940:0] cl_basic_team.c:126 CL_BASIC DEBUG failed to create tl shm team: (-1)
[1719264275.690571] [c318-001:1866939:0] ucc_tl.c:293 TL_UCP DEBUG team size 1 is too small, min supported 2
[1719264275.690576] [c318-001:1866939:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl self team
[1719264275.690578] [c318-001:1866939:0] cl_basic_team.c:126 CL_BASIC DEBUG failed to create tl shm team: (-1)
[1719264275.690580] [c318-001:1866939:0] cl_basic_team.c:126 CL_BASIC DEBUG failed to create tl ucp team: (-1)
[1719264275.690588] [c318-001:1866939:0] ucc_team.c:472 UCC INFO ===== COLL_SCORE_MAP (team_id 32771, size 1) =====
[1719264275.690593] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Allgather:
[1719264275.690582] [c318-001:1866940:0] cl_basic_team.c:126 CL_BASIC DEBUG failed to create tl ucp team: (-1)
[1719264275.690591] [c318-001:1866940:0] ucc_team.c:472 UCC INFO ===== COLL_SCORE_MAP (team_id 32771, size 1) =====
[1719264275.690593] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.690593] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.690593] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.690593] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.690593] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.690607] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Allgatherv:
[1719264275.690607] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.690607] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.690607] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.690607] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.690607] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.690622] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Allreduce:
[1719264275.690622] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.690601] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Allgather:
[1719264275.690601] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.690601] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.690601] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.690601] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.690601] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.690623] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Allgatherv:
[1719264275.690623] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.690623] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.690623] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.690622] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.690622] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.690622] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.690622] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.690650] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Alltoall:
[1719264275.690650] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.690650] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.690650] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.690650] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.690650] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.690674] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Alltoallv:
[1719264275.690674] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.690674] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.690674] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.690674] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.690674] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.690623] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.690623] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.690652] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Allreduce:
[1719264275.690652] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.690652] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.690652] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.690652] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.690652] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.690679] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Alltoall:
[1719264275.690679] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.690679] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.690679] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.690679] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.690679] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.690703] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Alltoallv:
[1719264275.690703] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.690703] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.690695] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Barrier:
[1719264275.690695] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.690695] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.690695] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.690695] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.690695] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.690721] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Bcast:
[1719264275.690721] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.690721] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.690721] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.690721] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.690721] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.690740] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Fanin:
[1719264275.690740] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.690740] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.690740] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.690740] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.690740] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.690758] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Fanout:
[1719264275.690758] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.690758] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO [1719264275.690703] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.690703] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.690703] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.690725] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Barrier:
[1719264275.690725] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.690725] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.690725] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.690725] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.690725] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.690755] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Bcast:
[1719264275.690755] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.690755] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.690755] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.690755] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
Cuda: {0..inf}:TL_SELF:10
[1719264275.690758] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.690758] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.690758] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.690774] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Gather:
[1719264275.690774] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.690774] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.690774] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.690774] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.690774] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.690808] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Gatherv:
[1719264275.690808] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.690808] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.690808] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.690808] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.690808] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.690831] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Reduce:
[1719264275.690831] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.690831] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.690831] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.690831] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.690831] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.690861] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Reduce_scatter:
[1719264275.690755] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.690784] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Fanin:
[1719264275.690784] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.690784] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.690784] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.690784] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.690784] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.690810] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Fanout:
[1719264275.690810] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.690810] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.690810] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.690810] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.690810] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.690835] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Gather:
[1719264275.690835] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.690835] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.690835] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.690835] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.690835] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.690864] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Gatherv:
[1719264275.690864] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.690864] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.690864] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO [1719264275.690861] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.690861] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.690861] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.690861] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.690861] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.690890] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Reduce_scatterv:
[1719264275.690890] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.690890] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.690890] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.690890] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.690890] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.690922] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Scatter:
[1719264275.690922] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.690922] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.690922] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.690922] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.690922] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.690937] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Scatterv:
[1719264275.690937] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.690937] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.690937] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.690937] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
CudaManaged: {0..inf}:TL_SELF:10
[1719264275.690864] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.690864] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.690894] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Reduce:
[1719264275.690894] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.690894] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.690894] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.690894] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.690894] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.690916] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Reduce_scatter:
[1719264275.690916] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.690916] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.690916] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.690916] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.690916] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.690937] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Reduce_scatterv:
[1719264275.690937] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.690937] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.690937] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.690937] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.690937] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.690962] [c318-001:1866939:0] ucc_team.c:474 UCC INFO ================================================
[1719264275.691029] [c318-001:1866939:0] ucc_team.c:370 UCC DEBUG team 0x54b6190 rank 0, ctx_rank 0, map_type 3
[1719264275.690937] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.690967] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Scatter:
[1719264275.690967] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.690967] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.690967] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.690967] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.690967] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.690999] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Scatterv:
[1719264275.690999] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.690999] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.690999] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.690999] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.690999] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.691019] [c318-001:1866940:0] ucc_team.c:474 UCC INFO ================================================
[1719264275.691029] [c318-001:1866940:0] ucc_team.c:370 UCC DEBUG team 0x3cc7570 rank 1, ctx_rank 1, map_type 3
[1719264275.691157] [c318-001:1866940:0] tl_cuda_team.c:109 TL_CUDA DEBUG posted tl team: 0x3cc8010
[1719264275.691161] [c318-001:1866940:0] cl_basic_team.c:52 CL_BASIC DEBUG posted cl team: 0x948190
[1719264275.691173] [c318-001:1866939:0] tl_cuda_team.c:109 TL_CUDA DEBUG posted tl team: 0x54b6c30
[1719264275.691178] [c318-001:1866939:0] cl_basic_team.c:52 CL_BASIC DEBUG posted cl team: 0x213b190
[1719264275.691175] [c318-001:1866940:0] tl_cuda_team_topo.c:315 TL_CUDA DEBUG no proxy found between dev 0000:21:00.0 (0) and dev 0000:e1:00.0 (1), cuda topology is not supported
[1719264275.691179] [c318-001:1866940:0] tl_cuda_team.c:134 TL_CUDA DEBUG finalizing tl team: 0x3cc8010
[1719264275.691183] [c318-001:1866939:0] tl_cuda_team_topo.c:315 TL_CUDA DEBUG no proxy found between dev 0000:21:00.0 (0) and dev 0000:e1:00.0 (1), cuda topology is not supported
[1719264275.691188] [c318-001:1866939:0] tl_cuda_team.c:134 TL_CUDA DEBUG finalizing tl team: 0x54b6c30
[1719264275.691391] [c318-001:1866940:0] ucc_tl.c:299 TL_SELF DEBUG team size 2 is too big, max supported 1
[1719264275.691392] [c318-001:1866939:0] ucc_tl.c:299 TL_SELF DEBUG team size 2 is too big, max supported 1
[1719264275.691398] [c318-001:1866939:0] tl_shm_team.c:158 TL_SHM DEBUG using perf params: generic
[1719264275.691429] [c318-001:1866940:0] tl_ucp_team.c:84 UCC DEBUG section not found
[1719264275.691432] [c318-001:1866940:0] tl_ucp_team.c:100 TL_UCP DEBUG opt knomial radix: 2
[1719264275.691434] [c318-001:1866940:0] tl_ucp_team.c:103 TL_UCP DEBUG posted tl team: 0x3cc8f50
[1719264275.691436] [c318-001:1866940:0] tl_ucp_team.c:202 TL_UCP DEBUG initialized tl team: 0x3cc8f50
[1719264275.691438] [c318-001:1866940:0] cl_basic_team.c:126 CL_BASIC DEBUG failed to create tl cuda team: (-1)
[1719264275.691429] [c318-001:1866939:0] tl_ucp_team.c:84 UCC DEBUG section not found
[1719264275.691432] [c318-001:1866939:0] tl_ucp_team.c:100 TL_UCP DEBUG opt knomial radix: 2
[1719264275.691433] [c318-001:1866939:0] tl_ucp_team.c:103 TL_UCP DEBUG posted tl team: 0x54b7b70
[1719264275.691435] [c318-001:1866939:0] tl_ucp_team.c:202 TL_UCP DEBUG initialized tl team: 0x54b7b70
[1719264275.691436] [c318-001:1866939:0] cl_basic_team.c:126 CL_BASIC DEBUG failed to create tl cuda team: (-1)
[1719264275.691438] [c318-001:1866939:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl nccl team
[1719264275.691440] [c318-001:1866940:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl nccl team
[1719264275.691443] [c318-001:1866940:0] cl_basic_team.c:126 CL_BASIC DEBUG failed to create tl self team: (-1)
[1719264275.691445] [c318-001:1866940:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl shm team
[1719264275.691446] [c318-001:1866940:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl ucp team
[1719264275.691440] [c318-001:1866939:0] cl_basic_team.c:126 CL_BASIC DEBUG failed to create tl self team: (-1)
[1719264275.691443] [c318-001:1866939:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl shm team
[1719264275.691444] [c318-001:1866939:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl ucp team
[1719264275.691458] [c318-001:1866939:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type host
[1719264275.691460] [c318-001:1866939:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type cuda
[1719264275.691463] [c318-001:1866939:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type cuda-managed
[1719264275.691459] [c318-001:1866940:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type host
[1719264275.691461] [c318-001:1866940:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type cuda
[1719264275.691463] [c318-001:1866940:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type cuda-managed
[1719264275.691501] [c318-001:1866939:0] ucc_team.c:472 UCC INFO ===== COLL_SCORE_MAP (team_id 32772, size 2) =====
[1719264275.691507] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Allgather:
[1719264275.691507] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..4095}:TL_UCP:10 {4K..inf}:TL_UCP:10
[1719264275.691507] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..4095}:TL_NCCL:10 {4K..inf}:TL_NCCL:10
[1719264275.691507] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..4095}:TL_NCCL:10 {4K..inf}:TL_NCCL:10
[1719264275.691520] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Allgatherv:
[1719264275.691520] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.691520] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..16383}:TL_NCCL:10 {16K..1048575}:TL_NCCL:10 {1M..inf}:TL_NCCL:10
[1719264275.691520] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_NCCL:10
[1719264275.691533] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Allreduce:
[1719264275.691533] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..4095}:TL_SHM:10 {4K..8K}:TL_SHM:10 {8193..inf}:TL_UCP:10
[1719264275.691533] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..4095}:TL_NCCL:10 {4K..inf}:TL_NCCL:10
[1719264275.691533] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..4095}:TL_NCCL:10 {4K..inf}:TL_NCCL:10
[1719264275.691548] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Alltoall:
[1719264275.691548] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..257}:TL_UCP:10 {258..inf}:TL_UCP:10
[1719264275.691548] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_NCCL:10
[1719264275.691548] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_NCCL:10
[1719264275.691555] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Alltoallv:
[1719264275.691555] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.691555] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_NCCL:10
[1719264275.691555] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_NCCL:10
[1719264275.691565] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Barrier:
[1719264275.691565] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SHM:10
[1719264275.691565] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_NCCL:10
[1719264275.691565] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_NCCL:10
[1719264275.691576] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Bcast:
[1719264275.691576] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..8K}:TL_SHM:10 {8193..inf}:TL_UCP:10
[1719264275.691576] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_NCCL:10
[1719264275.691576] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_NCCL:10
[1719264275.691584] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Fanin:
[1719264275.691584] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SHM:10
[1719264275.691584] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.691584] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.691595] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Fanout:
[1719264275.691595] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SHM:10
[1719264275.691595] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.691595] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.691607] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Gather:
[1719264275.691607] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.691607] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_NCCL:10
[1719264275.691607] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_NCCL:10
[1719264275.691618] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Gatherv:
[1719264275.691618] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.691618] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_NCCL:10
[1719264275.691618] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_NCCL:10
[1719264275.691636] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Reduce:
[1719264275.691636] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..8K}:TL_SHM:10 {8193..inf}:TL_UCP:10
[1719264275.691636] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_NCCL:10
[1719264275.691636] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_NCCL:10
[1719264275.691652] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Reduce_scatter:
[1719264275.691652] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.691652] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_NCCL:10
[1719264275.691652] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_NCCL:10
[1719264275.691664] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Reduce_scatterv:
[1719264275.691664] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.691664] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.691664] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.691672] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Scatter:
[1719264275.691672] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_NCCL:10
[1719264275.691672] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_NCCL:10
[1719264275.691677] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Scatterv:
[1719264275.691677] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.691677] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_NCCL:10
[1719264275.691677] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_NCCL:10
[1719264275.691690] [c318-001:1866939:0] ucc_team.c:474 UCC INFO ================================================
[1719264275.691706] [c318-001:1866940:0] tl_self_team.c:19 TL_SELF DEBUG posted tl team: 0x1352780
[1719264275.691708] [c318-001:1866940:0] cl_basic_team.c:52 CL_BASIC DEBUG posted cl team: 0x3cc9aa0
[1719264275.691710] [c318-001:1866940:0] tl_self_team.c:42 TL_SELF DEBUG initialized tl team: 0x1352780
[1719264275.691713] [c318-001:1866940:0] ucc_tl.c:293 TL_SHM DEBUG team size 1 is too small, min supported 2
[1719264275.691717] [c318-001:1866940:0] ucc_tl.c:293 TL_UCP DEBUG team size 1 is too small, min supported 2
[1719264275.691719] [c318-001:1866940:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl self team
[1719264275.691721] [c318-001:1866940:0] cl_basic_team.c:126 CL_BASIC DEBUG failed to create tl shm team: (-1)
[1719264275.691724] [c318-001:1866940:0] cl_basic_team.c:126 CL_BASIC DEBUG failed to create tl ucp team: (-1)
[1719264275.691735] [c318-001:1866940:0] ucc_team.c:472 UCC INFO ===== COLL_SCORE_MAP (team_id 32773, size 1) =====
[1719264275.691740] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Allgather:
[1719264275.691740] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.691740] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.691740] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.691740] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.691740] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.691759] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Allgatherv:
[1719264275.691759] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.691759] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.691759] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.691759] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.691759] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.691769] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Allreduce:
[1719264275.691769] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.691769] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.691769] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.691769] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.691769] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.691783] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Alltoall:
[1719264275.691783] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.691783] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.691783] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.691783] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.691783] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.691800] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Alltoallv:
[1719264275.691800] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.691800] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.691800] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.691800] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.691800] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.691815] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Barrier:
[1719264275.691815] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.691815] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.691815] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.691815] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.691815] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.691837] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Bcast:
[1719264275.691837] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.691837] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.691837] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.691837] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.691837] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.691862] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Fanin:
[1719264275.691862] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.691862] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.691862] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.691862] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.691862] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.691877] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Fanout:
[1719264275.691877] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.691877] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.691877] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.691877] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.691877] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.691900] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Gather:
[1719264275.691900] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.691900] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.691900] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.691900] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.691900] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.691917] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Gatherv:
[1719264275.691917] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.691917] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.691917] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.691917] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.691917] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.691935] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Reduce:
[1719264275.691935] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.691935] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.691935] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.691935] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.691935] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.691949] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Reduce_scatter:
[1719264275.691949] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.691949] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.691949] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.691949] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.691949] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO [1719264275.691958] [c318-001:1866939:0] tl_self_team.c:19 TL_SELF DEBUG posted tl team: 0x3f6b4a0
[1719264275.691965] [c318-001:1866939:0] cl_basic_team.c:52 CL_BASIC DEBUG posted cl team: 0x54b89a0
[1719264275.691972] [c318-001:1866939:0] tl_self_team.c:42 TL_SELF DEBUG initialized tl team: 0x3f6b4a0
[1719264275.691975] [c318-001:1866939:0] ucc_tl.c:293 TL_SHM DEBUG team size 1 is too small, min supported 2
[1719264275.691978] [c318-001:1866939:0] ucc_tl.c:293 TL_UCP DEBUG team size 1 is too small, min supported 2
[1719264275.691981] [c318-001:1866939:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl self team
RocmManaged: {0..inf}:TL_SELF:10
[1719264275.691979] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Reduce_scatterv:
[1719264275.691979] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.691979] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.691979] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.691979] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.691979] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.692000] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Scatter:
[1719264275.692000] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.692000] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO [1719264275.691984] [c318-001:1866939:0] cl_basic_team.c:126 CL_BASIC DEBUG failed to create tl shm team: (-1)
[1719264275.691989] [c318-001:1866939:0] cl_basic_team.c:126 CL_BASIC DEBUG failed to create tl ucp team: (-1)
[1719264275.691996] [c318-001:1866939:0] ucc_team.c:472 UCC INFO ===== COLL_SCORE_MAP (team_id 32773, size 1) =====
[1719264275.692002] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Allgather:
[1719264275.692002] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
Cuda: {0..inf}:TL_SELF:10
[1719264275.692000] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.692000] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.692000] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.692030] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Scatterv:
[1719264275.692030] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.692030] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.692030] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.692030] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.692030] [c318-001:1866940:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.692049] [c318-001:1866940:0] ucc_team.c:474 UCC INFO ================================================
[1719264275.692002] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.692002] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.692002] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.692002] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.692039] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Allgatherv:
[1719264275.692039] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.692039] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.692039] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.692039] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.692039] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.692057] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Allreduce:
[1719264275.692057] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.692057] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.692057] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.692057] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.692057] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.692074] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Alltoall:
[1719264275.692074] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.692074] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.692074] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.692074] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.692074] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.692097] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Alltoallv:
[1719264275.692097] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.692097] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.692097] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.692097] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.692097] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.692114] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Barrier:
[1719264275.692114] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.692114] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.692114] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.692114] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.692114] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.692130] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Bcast:
[1719264275.692130] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.692130] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.692130] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.692130] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.692130] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.692151] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Fanin:
[1719264275.692151] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.692151] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.692151] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.692151] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.692151] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.692165] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Fanout:
[1719264275.692165] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.692165] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.692165] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.692165] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.692165] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.692179] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Gather:
[1719264275.692179] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.692179] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.692179] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.692179] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.692179] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.692198] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Gatherv:
[1719264275.692198] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.692198] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.692198] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.692198] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.692198] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.692216] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Reduce:
[1719264275.692216] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.692216] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.692216] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.692216] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.692216] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.692234] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Reduce_scatter:
[1719264275.692234] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.692234] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.692234] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.692234] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.692234] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.692245] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Reduce_scatterv:
[1719264275.692245] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.692245] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.692245] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.692245] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.692245] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.692262] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Scatter:
[1719264275.692262] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.692262] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.692262] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.692262] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.692262] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.692278] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Scatterv:
[1719264275.692278] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SELF:10
[1719264275.692278] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_SELF:10
[1719264275.692278] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_SELF:10
[1719264275.692278] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Rocm: {0..inf}:TL_SELF:10
[1719264275.692278] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO RocmManaged: {0..inf}:TL_SELF:10
[1719264275.692296] [c318-001:1866939:0] ucc_team.c:474 UCC INFO ================================================
[1719264275.692309] [c318-001:1866940:0] ucc_team.c:370 UCC DEBUG team 0x3cd1ea0 rank 1, ctx_rank 1, map_type 3
[1719264275.692312] [c318-001:1866940:0] ucc_tl.c:299 TL_SELF DEBUG team size 2 is too big, max supported 1
[1719264275.692313] [c318-001:1866939:0] ucc_team.c:370 UCC DEBUG team 0x54c0df0 rank 0, ctx_rank 0, map_type 3
[1719264275.692316] [c318-001:1866939:0] ucc_tl.c:299 TL_SELF DEBUG team size 2 is too big, max supported 1
[1719264275.692318] [c318-001:1866939:0] cl_basic_team.c:52 CL_BASIC DEBUG posted cl team: 0x54c1260
[1719264275.692322] [c318-001:1866939:0] tl_shm_team.c:158 TL_SHM DEBUG using perf params: generic
[1719264275.692314] [c318-001:1866940:0] cl_basic_team.c:52 CL_BASIC DEBUG posted cl team: 0x3cd2310
[1719264275.692355] [c318-001:1866940:0] tl_ucp_team.c:84 UCC DEBUG section not found
[1719264275.692358] [c318-001:1866940:0] tl_ucp_team.c:100 TL_UCP DEBUG opt knomial radix: 2
[1719264275.692359] [c318-001:1866940:0] tl_ucp_team.c:103 TL_UCP DEBUG posted tl team: 0x3cd27b0
[1719264275.692360] [c318-001:1866940:0] tl_ucp_team.c:202 TL_UCP DEBUG initialized tl team: 0x3cd27b0
[1719264275.692355] [c318-001:1866939:0] tl_ucp_team.c:84 UCC DEBUG section not found
[1719264275.692358] [c318-001:1866939:0] tl_ucp_team.c:100 TL_UCP DEBUG opt knomial radix: 2
[1719264275.692359] [c318-001:1866939:0] tl_ucp_team.c:103 TL_UCP DEBUG posted tl team: 0x54c1700
[1719264275.692361] [c318-001:1866939:0] tl_ucp_team.c:202 TL_UCP DEBUG initialized tl team: 0x54c1700
[1719264275.692363] [c318-001:1866939:0] cl_basic_team.c:126 CL_BASIC DEBUG failed to create tl self team: (-1)
[1719264275.692366] [c318-001:1866939:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl shm team
[1719264275.692368] [c318-001:1866939:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl ucp team
[1719264275.692363] [c318-001:1866940:0] cl_basic_team.c:126 CL_BASIC DEBUG failed to create tl self team: (-1)
[1719264275.692367] [c318-001:1866940:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl shm team
[1719264275.692370] [c318-001:1866940:0] cl_basic_team.c:122 CL_BASIC DEBUG initialized tl ucp team
[1719264275.692374] [c318-001:1866940:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type host
[1719264275.692376] [c318-001:1866940:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type cuda
[1719264275.692377] [c318-001:1866940:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type cuda-managed
[1719264275.692372] [c318-001:1866939:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type host
[1719264275.692375] [c318-001:1866939:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type cuda
[1719264275.692377] [c318-001:1866939:0] tl_ucp_team.c:232 TL_UCP DEBUG enable support for memory type cuda-managed
[1719264275.692412] [c318-001:1866939:0] ucc_team.c:472 UCC INFO ===== COLL_SCORE_MAP (team_id 32774, size 2) =====
[1719264275.692418] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Allgather:
[1719264275.692418] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..4095}:TL_UCP:10 {4K..inf}:TL_UCP:10
[1719264275.692418] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..4095}:TL_UCP:10 {4K..inf}:TL_UCP:10
[1719264275.692418] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..4095}:TL_UCP:10 {4K..inf}:TL_UCP:10
[1719264275.692427] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Allgatherv:
[1719264275.692427] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.692427] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.692427] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.692441] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Allreduce:
[1719264275.692441] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..4095}:TL_SHM:10 {4K..8K}:TL_SHM:10 {8193..inf}:TL_UCP:10
[1719264275.692441] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..4095}:TL_UCP:10 {4K..inf}:TL_UCP:10
[1719264275.692441] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..4095}:TL_UCP:10 {4K..inf}:TL_UCP:10
[1719264275.692450] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Alltoall:
[1719264275.692450] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..257}:TL_UCP:10 {258..inf}:TL_UCP:10
[1719264275.692450] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.692450] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.692461] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Alltoallv:
[1719264275.692461] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.692461] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.692461] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.692470] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Barrier:
[1719264275.692470] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SHM:10
[1719264275.692470] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.692470] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.692482] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Bcast:
[1719264275.692482] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..8K}:TL_SHM:10 {8193..inf}:TL_UCP:10
[1719264275.692482] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.692482] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.692495] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Fanin:
[1719264275.692495] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SHM:10
[1719264275.692495] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.692495] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.692508] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Fanout:
[1719264275.692508] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_SHM:10
[1719264275.692508] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.692508] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.692520] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Gather:
[1719264275.692520] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.692520] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.692520] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.692530] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Gatherv:
[1719264275.692530] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.692530] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.692530] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.692544] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Reduce:
[1719264275.692544] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..8K}:TL_SHM:10 {8193..inf}:TL_UCP:10
[1719264275.692544] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.692544] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.692557] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Reduce_scatter:
[1719264275.692557] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.692557] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.692557] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.692571] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Reduce_scatterv:
[1719264275.692571] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.692571] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.692571] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.692579] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Scatterv:
[1719264275.692579] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Host: {0..inf}:TL_UCP:10
[1719264275.692579] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO Cuda: {0..inf}:TL_UCP:10
[1719264275.692579] [c318-001:1866939:0] ucc_coll_score_map.c:206 UCC INFO CudaManaged: {0..inf}:TL_UCP:10
[1719264275.692591] [c318-001:1866939:0] ucc_team.c:474 UCC INFO ================================================
[1719264276.679172] [c318-001:1866940:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14f5a5201080
[1719264276.679175] [c318-001:1866939:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14b095201080
[1719264276.680274] [c318-001:1866939:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14b095201080
[1719264276.680277] [c318-001:1866940:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14f5a5201080
[1719264276.886054] [c318-001:1866939:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14b095201080
[1719264276.886053] [c318-001:1866940:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14f5a5201080
[1719264276.886130] [c318-001:1866939:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14b095201080
[1719264276.886133] [c318-001:1866940:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14f5a5201080
[1719264277.049606] [c318-001:1866939:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14b095201080
[1719264277.049610] [c318-001:1866940:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14f5a5201080
[1719264277.049668] [c318-001:1866940:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14f5a5201080
[1719264277.049671] [c318-001:1866939:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14b095201080
[1719264277.212885] [c318-001:1866940:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14f5a5201080
[1719264277.212899] [c318-001:1866939:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14b095201080
[1719264277.212956] [c318-001:1866939:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14b095201080
[1719264277.212959] [c318-001:1866940:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14f5a5201080
[1719264277.213000] [c318-001:1866940:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14f5a5201080
[1719264277.213042] [c318-001:1866940:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14f5a5201080
[1719264277.213053] [c318-001:1866940:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14f5a5201080
[1719264277.213277] [c318-001:1866939:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14b095201080
[1719264277.213321] [c318-001:1866939:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14b095201080
[1719264277.213339] [c318-001:1866939:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14b095201080
[1719264277.213383] [c318-001:1866939:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14b095201080
[1719264277.213385] [c318-001:1866940:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14f5a5201080
[1719264277.213392] [c318-001:1866939:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14b095201080
[1719264277.217096] [c318-001:1866940:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14f5a5201080
[1719264277.217142] [c318-001:1866940:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14f5a5201080
[1719264277.217144] [c318-001:1866939:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14b095201080
[1719264277.217169] [c318-001:1866940:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14f5a5201080
[1719264277.217170] [c318-001:1866939:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14b095201080
[1719264277.217200] [c318-001:1866940:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14f5a5201080
[1719264277.217204] [c318-001:1866939:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14b095201080
[1719264277.217234] [c318-001:1866940:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14f5a5201080
[1719264277.217238] [c318-001:1866939:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14b095201080
[1719264277.217277] [c318-001:1866939:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14b095201080
[1719264277.217279] [c318-001:1866940:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14f5a5201080
[1719264277.217294] [c318-001:1866939:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14b095201080
[1719264277.217312] [c318-001:1866940:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14f5a5201080
[1719264277.217355] [c318-001:1866940:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14f5a5201080
[1719264277.217361] [c318-001:1866940:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14f5a5201080
[1719264277.217357] [c318-001:1866939:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14b095201080
[1719264277.217370] [c318-001:1866939:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14b095201080
[1719264277.217415] [c318-001:1866939:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14b095201080
[1719264277.217417] [c318-001:1866940:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14f5a5201080
[1719264277.217420] [c318-001:1866940:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14f5a5201080
[1719264277.217444] [c318-001:1866939:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14b095201080
[1719264277.217586] [c318-001:1866939:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14b095201080
[1719264277.217588] [c318-001:1866940:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14f5a5201080
[1719264277.217601] [c318-001:1866939:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14b095201080
[1719264277.217604] [c318-001:1866940:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14f5a5201080
[1719264277.217630] [c318-001:1866939:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14b095201080
[1719264277.217632] [c318-001:1866940:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14f5a5201080
[1719264277.217659] [c318-001:1866939:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14b095201080
[1719264277.217663] [c318-001:1866940:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14f5a5201080
[1719264277.217697] [c318-001:1866940:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14f5a5201080
[1719264277.217703] [c318-001:1866940:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14f5a5201080
[1719264277.217700] [c318-001:1866939:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14b095201080
[1719264277.217720] [c318-001:1866939:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14b095201080
[1719264277.217764] [c318-001:1866939:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14b095201080
[1719264277.217767] [c318-001:1866940:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14f5a5201080
[1719264277.217770] [c318-001:1866939:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14b095201080
[1719264277.217786] [c318-001:1866940:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14f5a5201080
[1719264277.217914] [c318-001:1866940:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14f5a5201080
[1719264277.217918] [c318-001:1866939:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14b095201080
[1719264277.217929] [c318-001:1866940:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14f5a5201080
[1719264277.217933] [c318-001:1866939:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14b095201080
[1719264277.217958] [c318-001:1866940:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14f5a5201080
[1719264277.217961] [c318-001:1866940:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14f5a5201080
[1719264277.217961] [c318-001:1866939:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14b095201080
[1719264277.217993] [c318-001:1866939:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14b095201080
[1719264277.218026] [c318-001:1866939:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14b095201080
[1719264277.218028] [c318-001:1866940:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14f5a5201080
[1719264277.218031] [c318-001:1866940:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14f5a5201080
[1719264277.218042] [c318-001:1866939:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14b095201080
[1719264277.218181] [c318-001:1866939:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14b095201080
[1719264277.218184] [c318-001:1866940:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14f5a5201080
[1719264277.218195] [c318-001:1866939:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14b095201080
[1719264277.218200] [c318-001:1866940:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14f5a5201080
[1719264277.218224] [c318-001:1866939:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14b095201080
[1719264277.218228] [c318-001:1866940:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14f5a5201080
[1719264277.546848] [c318-001:1866939:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14b095201080
[1719264277.546848] [c318-001:1866940:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14f5a5201080
[1719264277.591765] [c318-001:1866939:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14b095201080
[1719264277.591766] [c318-001:1866940:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14f5a5201080
[1719264277.725672] [c318-001:1866939:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14b095201080
[1719264277.726510] [c318-001:1866940:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14f5a5201080
[1719264277.770714] [c318-001:1866940:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14f5a5201080
[1719264277.770717] [c318-001:1866939:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14b095201080
[1719264277.770779] [c318-001:1866940:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14f5a5201080
[1719264277.770807] [c318-001:1866939:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14b095201080
[1719264277.770867] [c318-001:1866939:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14b095201080
[1719264277.770870] [c318-001:1866940:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14f5a5201080
[1719264277.770884] [c318-001:1866939:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14b095201080
[1719264277.770894] [c318-001:1866940:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14f5a5201080
[1719264277.770950] [c318-001:1866940:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14f5a5201080
[1719264277.770954] [c318-001:1866939:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14b095201080
[1719264277.771989] [c318-001:1866940:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14f5a5201080
[1719264277.772203] [c318-001:1866939:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14b095201080
[1719264277.772247] [c318-001:1866939:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14b095201080
[1719264277.772250] [c318-001:1866940:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14f5a5201080
[1719264277.772273] [c318-001:1866939:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14b095201080
[1719264277.787313] [c318-001:1866940:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14f5a5201080
[1719264277.787358] [c318-001:1866940:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14f5a5201080
[1719264277.787362] [c318-001:1866939:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14b095201080
[1719264277.787384] [c318-001:1866940:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14f5a5201080
[1719264277.787644] [c318-001:1866939:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14b095201080
[1719264277.787704] [c318-001:1866939:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14b095201080
[1719264277.787707] [c318-001:1866940:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14f5a5201080
[1719264277.787721] [c318-001:1866939:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14b095201080
[1719264277.787726] [c318-001:1866940:0] tl_nccl_team.c:293 TL_NCCL DEBUG init coll task 0x14f5a5201080
[1719264277.787796] [c318-001:1866940:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14f5a5201080
[1719264277.787799] [c318-001:1866939:0] tl_nccl_coll.c:219 TL_NCCL DEBUG finalizing coll task 0x14b095201080
[1719264277.787851] [c318-001:1866940:0] ucc_coll_score_map.c:144 UCC DEBUG coll Bcast is not supported for TL_SHM, fallback TL_UCP
[1719264277.787893] [c318-001:1866939:0] ucc_coll_score_map.c:144 UCC DEBUG coll Bcast is not supported for TL_SHM, fallback TL_UCP
print_host_matrix : Output matrix X
0.03
0.04
0.14
0.07
0.03
0.08
0.09
0.20
0.19
0.42
|b - A*x|_inf = 2.220446E-16
|x|_inf = 1.273718E+00
|b|_inf = 1.000000E+01
|A|_inf = 1.674133E+01
|b - A*x|/(|A|*|x|+|b|) = 7.088702E-18
[1719264277.788895] [c318-001:1866940:0] tl_shm_team.c:534 TL_SHM DEBUG finalizing tl team: 0xe62990
[1719264277.788903] [c318-001:1866940:0] tl_ucp_team.c:110 TL_UCP DEBUG finalizing tl team: 0x3ca6b10
[1719264277.788912] [c318-001:1866940:0] cl_basic_team.c:61 CL_BASIC DEBUG finalizing cl team: 0x17ae010
[1719264277.788975] [c318-001:1866939:0] tl_shm_team.c:534 TL_SHM DEBUG finalizing tl team: 0x2655990
[1719264277.788982] [c318-001:1866939:0] tl_ucp_team.c:110 TL_UCP DEBUG finalizing tl team: 0x544f840
[1719264277.788988] [c318-001:1866939:0] cl_basic_team.c:61 CL_BASIC DEBUG finalizing cl team: 0x3689cc0
[1719264277.788992] [c318-001:1866939:0] ucc_team.c:528 UCC INFO team destroyed, team_id 32769
[1719264277.812779] [c318-001:1866939:0] tl_nccl_team.c:65 TL_NCCL DEBUG finalizing tl team: 0x26524d0
[1719264277.812799] [c318-001:1866939:0] tl_shm_team.c:534 TL_SHM DEBUG finalizing tl team: 0x2655760
[1719264277.812805] [c318-001:1866939:0] tl_ucp_team.c:110 TL_UCP DEBUG finalizing tl team: 0x54ad360
[1719264277.812812] [c318-001:1866939:0] cl_basic_team.c:61 CL_BASIC DEBUG finalizing cl team: 0x2e429b0
[1719264277.812817] [c318-001:1866939:0] ucc_team.c:528 UCC INFO team destroyed, team_id 32770
[1719264277.812833] [c318-001:1866939:0] tl_self_team.c:25 TL_SELF DEBUG finalizing tl team: 0x3f6b610
[1719264277.812837] [c318-001:1866939:0] cl_basic_team.c:61 CL_BASIC DEBUG finalizing cl team: 0x2627e50
[1719264277.812839] [c318-001:1866939:0] ucc_team.c:528 UCC INFO team destroyed, team_id 32771
[1719264277.835520] [c318-001:1866939:0] tl_nccl_team.c:65 TL_NCCL DEBUG finalizing tl team: 0x54b6c30
[1719264277.835537] [c318-001:1866939:0] tl_shm_team.c:534 TL_SHM DEBUG finalizing tl team: 0x2655530
[1719264277.835543] [c318-001:1866939:0] tl_ucp_team.c:110 TL_UCP DEBUG finalizing tl team: 0x54b7b70
[1719264277.835549] [c318-001:1866939:0] cl_basic_team.c:61 CL_BASIC DEBUG finalizing cl team: 0x213b190
[1719264277.835552] [c318-001:1866939:0] ucc_team.c:528 UCC INFO team destroyed, team_id 32772
[1719264277.835562] [c318-001:1866939:0] tl_self_team.c:25 TL_SELF DEBUG finalizing tl team: 0x3f6b4a0
[1719264277.835566] [c318-001:1866939:0] cl_basic_team.c:61 CL_BASIC DEBUG finalizing cl team: 0x54b89a0
[1719264277.835569] [c318-001:1866939:0] ucc_team.c:528 UCC INFO team destroyed, team_id 32773
[1719264277.835581] [c318-001:1866939:0] tl_shm_team.c:534 TL_SHM DEBUG finalizing tl team: 0x2655250
[1719264277.835584] [c318-001:1866939:0] tl_ucp_team.c:110 TL_UCP DEBUG finalizing tl team: 0x54c1700
[1719264277.835587] [c318-001:1866939:0] cl_basic_team.c:61 CL_BASIC DEBUG finalizing cl team: 0x54c1260
[1719264277.835591] [c318-001:1866939:0] ucc_team.c:528 UCC INFO team destroyed, team_id 32774
[1719264278.020109] [c318-001:1866940:0] tl_nccl_team.c:65 TL_NCCL DEBUG finalizing tl team: 0xe5f4d0
[1719264278.020144] [c318-001:1866940:0] tl_shm_team.c:534 TL_SHM DEBUG finalizing tl team: 0xe62760
[1719264278.020153] [c318-001:1866940:0] tl_ucp_team.c:110 TL_UCP DEBUG finalizing tl team: 0x3c66850
[1719264278.020162] [c318-001:1866940:0] cl_basic_team.c:61 CL_BASIC DEBUG finalizing cl team: 0x164ec60
[1719264278.020182] [c318-001:1866940:0] tl_self_team.c:25 TL_SELF DEBUG finalizing tl team: 0x277d150
[1719264278.020187] [c318-001:1866940:0] cl_basic_team.c:61 CL_BASIC DEBUG finalizing cl team: 0xe34e50
[1719264278.020189] [c318-001:1866940:0] ucc_team.c:528 UCC INFO team destroyed, team_id 32771
[1719264278.024440] [c318-001:1866940:0] tl_nccl_team.c:65 TL_NCCL DEBUG finalizing tl team: 0x3cc8010
[1719264278.024459] [c318-001:1866940:0] tl_shm_team.c:534 TL_SHM DEBUG finalizing tl team: 0xe62530
[1719264278.024464] [c318-001:1866940:0] tl_ucp_team.c:110 TL_UCP DEBUG finalizing tl team: 0x3cc8f50
[1719264278.024469] [c318-001:1866940:0] cl_basic_team.c:61 CL_BASIC DEBUG finalizing cl team: 0x948190
[1719264278.024479] [c318-001:1866940:0] tl_self_team.c:25 TL_SELF DEBUG finalizing tl team: 0x1352780
[1719264278.024483] [c318-001:1866940:0] cl_basic_team.c:61 CL_BASIC DEBUG finalizing cl team: 0x3cc9aa0
[1719264278.024486] [c318-001:1866940:0] ucc_team.c:528 UCC INFO team destroyed, team_id 32773
[1719264278.024502] [c318-001:1866940:0] tl_shm_team.c:534 TL_SHM DEBUG finalizing tl team: 0xe62250
[1719264278.024507] [c318-001:1866940:0] tl_ucp_team.c:110 TL_UCP DEBUG finalizing tl team: 0x3cd27b0
[1719264278.024513] [c318-001:1866940:0] cl_basic_team.c:61 CL_BASIC DEBUG finalizing cl team: 0x3cd2310
[1719264278.031646] [c318-001:1866940:0] tl_nccl_team.c:65 TL_NCCL DEBUG finalizing tl team: 0x1e99e10
[1719264278.031662] [c318-001:1866940:0] tl_shm_team.c:534 TL_SHM DEBUG finalizing tl team: 0x277a490
[1719264278.031666] [c318-001:1866940:0] tl_ucp_team.c:110 TL_UCP DEBUG finalizing tl team: 0x34ff090
[1719264278.031672] [c318-001:1866940:0] cl_basic_team.c:61 CL_BASIC DEBUG finalizing cl team: 0x1f4fde0
[1719264278.031683] [c318-001:1866940:0] tl_ucp_team.c:110 TL_UCP DEBUG finalizing tl team: 0x340e050
[1719264278.031688] [c318-001:1866940:0] cl_basic_context.c:57 CL_BASIC DEBUG finalizing cl context: 0x29e7c10
[1719264278.031694] [c318-001:1866940:0] tl_self_context.c:39 TL_SELF DEBUG finalizing tl context: 0x2d76ab0
[1719264278.031710] [c318-001:1866940:0] tl_shm_context.c:39 TL_SHM DEBUG finalizing tl context: 0x2d76bc0
[1719264278.031715] [c318-001:1866940:0] tl_ucp_context.c:380 TL_UCP DEBUG finalizing tl context: 0x29c8010
[1719264278.032496] [c318-001:1866939:0] tl_nccl_team.c:65 TL_NCCL DEBUG finalizing tl team: 0x373fad0
[1719264278.032524] [c318-001:1866939:0] tl_shm_team.c:534 TL_SHM DEBUG finalizing tl team: 0x3f69730
[1719264278.032531] [c318-001:1866939:0] tl_ucp_team.c:110 TL_UCP DEBUG finalizing tl team: 0x4cea090
[1719264278.032536] [c318-001:1866939:0] cl_basic_team.c:61 CL_BASIC DEBUG finalizing cl team: 0x373fc20
[1719264278.032539] [c318-001:1866939:0] ucc_team.c:528 UCC INFO team destroyed, team_id 32768
[1719264278.032552] [c318-001:1866939:0] tl_ucp_team.c:110 TL_UCP DEBUG finalizing tl team: 0x4b62ac0
[1719264278.032556] [c318-001:1866939:0] cl_basic_context.c:57 CL_BASIC DEBUG finalizing cl context: 0x41d6fb0
[1719264278.032563] [c318-001:1866939:0] tl_self_context.c:39 TL_SELF DEBUG finalizing tl context: 0x456dea0
[1719264278.032573] [c318-001:1866939:0] tl_shm_context.c:39 TL_SHM DEBUG finalizing tl context: 0x4376510
[1719264278.032578] [c318-001:1866939:0] tl_ucp_context.c:380 TL_UCP DEBUG finalizing tl context: 0x41b7010
[1719264278.051167] [c318-001:1866939:0] tl_ucp_team.c:110 TL_UCP DEBUG finalizing tl team: 0x4376060
[1719264278.051177] [c318-001:1866939:0] cl_basic_context.c:57 CL_BASIC DEBUG finalizing cl context: 0x3f68830
[1719264278.051183] [c318-001:1866939:0] tl_cuda_context.c:81 TL_CUDA DEBUG finalizing tl context: 0x39429d0
[1719264278.051192] [c318-001:1866939:0] tl_nccl_context.c:211 TL_NCCL DEBUG finalizing tl context: 0x3736120
[1719264278.051201] [c318-001:1866940:0] tl_ucp_team.c:110 TL_UCP DEBUG finalizing tl team: 0x2d76470
[1719264278.051210] [c318-001:1866940:0] cl_basic_context.c:57 CL_BASIC DEBUG finalizing cl context: 0x2779590
[1719264278.051217] [c318-001:1866940:0] tl_cuda_context.c:81 TL_CUDA DEBUG finalizing tl context: 0x2152ef0
[1719264278.051223] [c318-001:1866940:0] tl_nccl_context.c:211 TL_NCCL DEBUG finalizing tl context: 0x1f46270
[1719264278.051566] [c318-001:1866939:0] tl_self_context.c:39 TL_SELF DEBUG finalizing tl context: 0x3f5e8d0
[1719264278.051572] [c318-001:1866939:0] tl_shm_context.c:39 TL_SHM DEBUG finalizing tl context: 0x3f5ea00
[1719264278.051577] [c318-001:1866939:0] tl_ucp_context.c:380 TL_UCP DEBUG finalizing tl context: 0x3712910
[1719264278.051697] [c318-001:1866940:0] tl_self_context.c:39 TL_SELF DEBUG finalizing tl context: 0x276f560
[1719264278.051703] [c318-001:1866940:0] tl_shm_context.c:39 TL_SHM DEBUG finalizing tl context: 0x276f690
[1719264278.051707] [c318-001:1866940:0] tl_ucp_context.c:380 TL_UCP DEBUG finalizing tl context: 0x1f22a60
[1719264278.068473] [c318-001:1866939:0] tl_cuda_lib.c:41 TL_CUDA DEBUG finalizing lib object: 0x3c9eed0
[1719264278.068480] [c318-001:1866939:0] tl_nccl_lib.c:22 TL_NCCL DEBUG finalizing lib object: 0x3f5c3f0
[1719264278.068484] [c318-001:1866939:0] tl_self_lib.c:26 TL_SELF DEBUG finalizing lib object: 0x3c89030
[1719264278.068488] [c318-001:1866939:0] tl_shm_lib.c:30 TL_SHM DEBUG finalizing lib object: 0x3f28bb0
[1719264278.068493] [c318-001:1866939:0] tl_ucp_lib.c:83 TL_UCP DEBUG finalizing lib object: 0x3f28e70
[1719264278.068497] [c318-001:1866939:0] cl_basic_lib.c:26 CL_BASIC DEBUG finalizing lib object: 0x37479f0
[1719264278.068508] [c318-001:1866940:0] tl_cuda_lib.c:41 TL_CUDA DEBUG finalizing lib object: 0x24afb40
[1719264278.068515] [c318-001:1866940:0] tl_nccl_lib.c:22 TL_NCCL DEBUG finalizing lib object: 0x2739f40
[1719264278.068519] [c318-001:1866940:0] tl_self_lib.c:26 TL_SELF DEBUG finalizing lib object: 0x24999d0
[1719264278.068523] [c318-001:1866940:0] tl_shm_lib.c:30 TL_SHM DEBUG finalizing lib object: 0x2739b40
[1719264278.068527] [c318-001:1866940:0] tl_ucp_lib.c:83 TL_UCP DEBUG finalizing lib object: 0x27661c0
[1719264278.068530] [c318-001:1866940:0] cl_basic_lib.c:26 CL_BASIC DEBUG finalizing lib object: 0x1f57f10
[1719264278.069623] [c318-001:1866939:0] tl_shm_team.c:534 TL_SHM DEBUG finalizing tl team: 0x2655be0
[1719264278.069630] [c318-001:1866939:0] tl_ucp_team.c:110 TL_UCP DEBUG finalizing tl team: 0x32b1de0
[1719264278.069636] [c318-001:1866939:0] cl_basic_team.c:61 CL_BASIC DEBUG finalizing cl team: 0x3203d60
[1719264278.069640] [c318-001:1866939:0] ucc_team.c:528 UCC INFO team destroyed, team_id 32768
[1719264278.069647] [c318-001:1866939:0] tl_ucp_team.c:110 TL_UCP DEBUG finalizing tl team: 0x2e107b0
[1719264278.069651] [c318-001:1866939:0] cl_basic_context.c:57 CL_BASIC DEBUG finalizing cl context: 0x2b512c0
[1719264278.069655] [c318-001:1866939:0] cl_hier_context.c:75 CL_HIER DEBUG finalizing cl context: 0x2b4aad0
[1719264278.069664] [c318-001:1866939:0] tl_self_context.c:39 TL_SELF DEBUG finalizing tl context: 0x2b47350
[1719264278.069667] [c318-001:1866939:0] tl_shm_context.c:39 TL_SHM DEBUG finalizing tl context: 0x2b47440
[1719264278.069671] [c318-001:1866939:0] tl_ucp_context.c:380 TL_UCP DEBUG finalizing tl context: 0x2b0ebb0
[1719264278.069773] [c318-001:1866940:0] tl_shm_team.c:534 TL_SHM DEBUG finalizing tl team: 0xe62be0
[1719264278.069780] [c318-001:1866940:0] tl_ucp_team.c:110 TL_UCP DEBUG finalizing tl team: 0x1ad0fc0
[1719264278.069787] [c318-001:1866940:0] cl_basic_team.c:61 CL_BASIC DEBUG finalizing cl team: 0x16c91f0
[1719264278.069796] [c318-001:1866940:0] tl_ucp_team.c:110 TL_UCP DEBUG finalizing tl team: 0x161c7a0
[1719264278.069800] [c318-001:1866940:0] cl_basic_context.c:57 CL_BASIC DEBUG finalizing cl context: 0x1354f80
[1719264278.069804] [c318-001:1866940:0] cl_hier_context.c:75 CL_HIER DEBUG finalizing cl context: 0x1349230
[1719264278.069810] [c318-001:1866940:0] tl_self_context.c:39 TL_SELF DEBUG finalizing tl context: 0x1345bc0
[1719264278.069813] [c318-001:1866940:0] tl_shm_context.c:39 TL_SHM DEBUG finalizing tl context: 0x1345cb0
[1719264278.069817] [c318-001:1866940:0] tl_ucp_context.c:380 TL_UCP DEBUG finalizing tl context: 0x12f9850
[1719264278.086467] [c318-001:1866939:0] tl_cuda_lib.c:41 TL_CUDA DEBUG finalizing lib object: 0x2ad4c30
[1719264278.086476] [c318-001:1866939:0] tl_self_lib.c:26 TL_SELF DEBUG finalizing lib object: 0x2b20130
[1719264278.086479] [c318-001:1866939:0] tl_shm_lib.c:30 TL_SHM DEBUG finalizing lib object: 0x2b46e20
[1719264278.086482] [c318-001:1866939:0] tl_ucp_lib.c:83 TL_UCP DEBUG finalizing lib object: 0x2516e20
[1719264278.086486] [c318-001:1866939:0] cl_basic_lib.c:26 CL_BASIC DEBUG finalizing lib object: 0x2b46040
[1719264278.086491] [c318-001:1866939:0] cl_hier_lib.c:61 CL_HIER DEBUG finalizing lib object: 0x27d5c20
[1719264278.086565] [c318-001:1866940:0] tl_cuda_lib.c:41 TL_CUDA DEBUG finalizing lib object: 0x12f9450
[1719264278.086572] [c318-001:1866940:0] tl_self_lib.c:26 TL_SELF DEBUG finalizing lib object: 0x12fbd90
[1719264278.086576] [c318-001:1866940:0] tl_shm_lib.c:30 TL_SHM DEBUG finalizing lib object: 0x1345690
[1719264278.086579] [c318-001:1866940:0] tl_ucp_lib.c:83 TL_UCP DEBUG finalizing lib object: 0xd23e20
[1719264278.086582] [c318-001:1866940:0] cl_basic_lib.c:26 CL_BASIC DEBUG finalizing lib object: 0x13448b0
[1719264278.086585] [c318-001:1866940:0] cl_hier_lib.c:61 CL_HIER DEBUG finalizing lib object: 0xfe2ac0
I think it is the unsupported backend thing that you mentioned.
I do not see the RDMA error here. So if you run with UCC_TLS=^mlx5,sharp
but without UCC_LOG_LEVEL=DEBUG
, the output looks clean?
Yes, the output is clean (aside from those warnings about "initialized with different thread mode"). Thanks for your help.
I am trying to set up cuSOLVERMp on my HPC system (Lonestar 6 at TACC). I loaded the HPC SDK module
I successfully compiled the cuSOLVERMp examples with
make
I then ran one of the examples and got the following errors. The other examples show similar behavior.
Aside from the error in the middle, it looks like the program ran successfully, but the verbose output shows some NaNs were present in the computations.
If I try running with
-np 3
, the program hangs after the[c306-001][Jun 19 13:32:18 188430][GENERAL][2563496][error] - unable to query AM service record(AM query)
line.I would appreciate any help to resolve this.