issues
search
aws
/
aws-ofi-nccl
This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.
Apache License 2.0
140
stars
54
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
feat(rdma): constrain C linkage to init
#591
aws-nslick
closed
5 days ago
2
fix(tracing): use header-only nvtx3
#590
aws-nslick
closed
2 weeks ago
0
fix(build): check features before mangling CFLAGS
#589
aws-nslick
closed
4 days ago
1
feat(build): add -Wextra to "picky" compiler flags
#588
aws-nslick
closed
5 days ago
0
fix(test): fix typing issues
#587
aws-nslick
closed
5 days ago
0
fix(rdma): avoid enum/integral comparison
#586
aws-nslick
closed
5 days ago
0
fix(tree): add fallthrough switch markers
#585
aws-nslick
closed
5 days ago
1
register_mr_buffers:544 NCCL WARN NET/OFI Unable to register memory (type = 2) for device 0. RC: -22, Error: Invalid argument
#584
visatish
opened
3 weeks ago
8
fix(tuner): don't choose NVLSTree if nRanks==nNodes
#583
AmedeoSapio
closed
3 weeks ago
1
chore(.github/workflows): constrain push triggers to known branches
#582
aws-nslick
closed
2 weeks ago
2
fix(cuda): avoid loading stub
#581
aws-nslick
closed
1 week ago
4
.ci/aws: Stop Running ofi nccl functional tests until they are fixed
#580
a-szegel
closed
3 weeks ago
1
.ci/aws: Pin p4/p5 ami's to AMI's from 8/7/24
#579
a-szegel
closed
3 weeks ago
2
chore(build): replace `-Wc++-compat' with `-x c++'
#578
aws-nslick
closed
5 days ago
0
fix(neuron): remove const from ncclNetPlugin_v{4,5} syms
#577
aws-nslick
closed
5 days ago
0
fix(sendrecv): add missing nccl-headers include
#576
aws-nslick
closed
5 days ago
0
fix(tree): avoid sign comparison issues
#575
aws-nslick
closed
4 days ago
1
fix(rdma): use COMM_ID_MASK as invalid id
#574
aws-nslick
closed
5 days ago
2
fix(tuner): fix implicit conversions
#573
aws-nslick
closed
5 days ago
0
fix(idpool): avoid sign comparison issues
#572
aws-nslick
closed
2 weeks ago
1
fix(param): move some parameters to unsigned
#571
aws-nslick
closed
6 days ago
2
feat(param): add uint parameter macro
#570
aws-nslick
closed
5 days ago
1
fix(tuner): avoid gotos
#569
aws-nslick
closed
5 days ago
0
feat(test): parse as c++ source
#568
aws-nslick
closed
5 days ago
0
chore(build): mpi: set mpicxx, too.
#567
aws-nslick
closed
5 days ago
2
chore(build): add AC_PROG_CXX
#566
aws-nslick
closed
5 days ago
1
fix(tree): use decltype instead of typeof for cxx
#565
aws-nslick
closed
5 days ago
1
fix(api): avoid mid-function initiializers
#564
aws-nslick
closed
2 weeks ago
2
fix(tree): move declarations to top of function
#563
aws-nslick
closed
1 day ago
4
nit: msgbuff: avoid typedef hell
#562
aws-nslick
closed
4 weeks ago
1
nit: deque: avoid typedef hell
#561
aws-nslick
closed
4 weeks ago
1
fix(freelist): use uintptr_t for pointer arithmetic
#560
aws-nslick
closed
2 weeks ago
1
build: add [[maybe_unused]] on EXPORT macro
#559
aws-nslick
closed
3 weeks ago
0
fix(rdma): fi_{send,write}data: do arithmetic on uintptr
#558
aws-nslick
closed
3 weeks ago
1
fix(aws): align declaration and init order
#557
aws-nslick
closed
3 weeks ago
1
feat(tree): add static_assert shim macro
#556
aws-nslick
closed
3 weeks ago
4
fix(tree): add spaces around PRIu64
#555
aws-nslick
closed
3 weeks ago
2
fix(tree): use correct __cplusplus guards
#554
aws-nslick
closed
3 weeks ago
4
rdma: Eliminate unnecessary ctrl message waits in eager protocol
#553
rauteric
closed
3 weeks ago
10
.ci/aws: Unpin al2 p3dn ami
#552
a-szegel
closed
1 week ago
3
fix(rdma): endpont_per_comm: NULL ptr bug
#551
rauteric
closed
1 month ago
1
.ci/aws: Decrease NCCL_TEST iterations to 5
#550
a-szegel
closed
3 weeks ago
7
tuner: Enable tuner init msg on INFO logs
#549
arunkarthik-akkart
closed
4 weeks ago
7
tuner: Enable tuner init msg on INFO logs
#548
arunkarthik-akkart
closed
1 month ago
0
RDMA support for g6e nodes
#547
Abhishek8394
opened
1 month ago
0
ci: fix efa installer caching
#546
aws-nslick
closed
1 month ago
1
ci: cache efa installer
#545
aws-nslick
closed
1 month ago
1
Expose each libfabric NIC as one NIC device to the user in case of non-NVIDIA platforms
#544
maxtmann
closed
1 month ago
0
Separate endpoint for control messages
#543
rajachan
closed
3 weeks ago
10
add ci build of rpms
#542
aws-nslick
closed
2 weeks ago
6
Previous
Next