issues
search
NVIDIA
/
nccl-tests
NCCL Tests
BSD 3-Clause "New" or "Revised" License
809
stars
229
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
all_reduce_perf fails on 2 nodes
#150
scvance
closed
1 year ago
2
Expected bandwidth results? 8x A100 GPUs over NVLink
#149
acgandhi
opened
1 year ago
10
test error: stuck when run test example
#148
zhengmq2010
opened
1 year ago
4
src/Makefile: remove unused variables
#147
yangxingwu
opened
1 year ago
0
makefile: remove extra space
#146
yangxingwu
closed
1 year ago
0
[91mnvcc fatal : Unsupported gpu architecture 'compute_35' [0m[91mmake[1]: *** [Makefile:84: ../build/all_reduce.o] Error 1 for nvcr.io/nvidia/pytorch:23.02-py3
#145
monajalal
closed
1 year ago
2
question regarding versioning
#144
monajalal
closed
1 year ago
0
Bus error when using 16 GPUs in one node
#143
richardsliu
closed
1 year ago
7
why add ALIGN in allgather/reducescatter/hypercube
#142
ziyueSeo
closed
1 year ago
0
Multi-node test within a docker container
#141
deepakn94
opened
1 year ago
1
the tag v2.12.10 is missing
#140
terryhy520
opened
1 year ago
1
All2All Benchmarks on Perlmutter
#139
caoshiyi
opened
1 year ago
8
Csv format
#138
lipovsek-aws
opened
1 year ago
3
nccl-tests ignores NCCL_HOME if there exists system wide installation in /usr
#137
nishshah0
opened
1 year ago
6
No commuication between two nodes
#136
GongZhengLi
closed
1 year ago
2
fix handling of variable NVCC.
#135
aavbsouza
closed
1 year ago
0
Update README.md
#134
flx42
closed
1 year ago
0
Running nccl-test on two nodes failed
#133
zhangciba
opened
1 year ago
1
unable to complete a TCP connection to another process
#132
odellus
opened
1 year ago
5
./all_reduce_perf: "Out of bounds values: 50 FAILED" [2 GPUs, PHB]
#131
Meriipu
opened
1 year ago
12
algorithm bandwidth of all2all
#130
de1star
closed
1 year ago
2
"alias must point to a defined variable or function"
#129
rainwoodman
opened
1 year ago
1
nccl-tests on different GPUs
#128
de1star
closed
1 year ago
11
Test bench
#127
novaCoder-zrk
opened
1 year ago
0
Align print format string for column names and units
#126
dmitrygx
opened
1 year ago
5
Failure when more than 2 GPUs in each node
#125
dogacancolak
closed
1 year ago
5
ArchLinux test Failed
#124
jacklu333333
opened
1 year ago
3
Understanding the latency of NCCL
#123
ConnollyLeon
opened
1 year ago
2
Update getHostHash() to avoid hash conflict
#122
dong0321
closed
1 year ago
3
Multi-Node Launch
#121
apoorvemohan
opened
1 year ago
1
Option to output results in csv and json format
#120
avolkov1
opened
1 year ago
11
Evaluation of NCCL test result
#119
Yujaeseo
closed
1 year ago
2
nccl-test result (error field)
#118
susol-hjkim
opened
1 year ago
1
NCCL all_reduce_perf test hangs with multiple RTX 4090 GPUs, works fine when I swap in 2080tis
#117
RCS1
closed
1 year ago
47
Update README.md
#116
BlueCloudDev
closed
1 year ago
4
The multi-gpu tests always hang and NCCL cannot find CUDA
#115
SusuXu
opened
2 years ago
5
Does not compile with NVHPC 22.7
#114
zyndagj
closed
2 years ago
2
Support setting CUDA_VISIBLE_DEVICES env variable
#113
ryanamazon
opened
2 years ago
7
nccl test only gets ~65% of the link bandwidth
#112
sandyhouse
closed
2 years ago
10
nccl test failed with mpirun for two machines
#111
sandyhouse
closed
2 years ago
6
The size of grid and block seems mismatch
#110
ihchoi12
opened
2 years ago
2
Do ranks on multiple nodes participate in ops or is the test standalone?
#109
MrAta
closed
2 years ago
1
Test failure when NCCL_MIN_NCHANNELS is set to a value other than 2
#108
ihchoi12
opened
2 years ago
2
How to understand the result?
#107
ihchoi12
opened
2 years ago
8
Inconsistent all_reduce busbw between 2 nodes
#106
zhengwy888
opened
2 years ago
9
Support setting CUDA_VISIBLE_DEVICES env variable
#105
nzmsv
closed
2 years ago
4
Got different results on same devices and same tests
#104
HaoKang-Timmy
closed
2 years ago
2
where is mpi.h
#103
ShivanshuPurohit
opened
2 years ago
0
Multiple node NCCL tests hang
#102
aamcintosh
opened
2 years ago
3
Profiling all_reduce_perf with Nsight hangs
#101
caogao
opened
2 years ago
1
Previous
Next