issues
search
microsoft
/
mscclpp
MSCCL++: A GPU-driven communication stack for scalable AI applications
MIT License
246
stars
38
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
v0.5.2
#328
chhwang
closed
3 months ago
0
Support to write packets via uint2
#327
Binyang2014
closed
3 months ago
0
Multi-stream CUDA IPC
#326
chhwang
opened
3 months ago
0
Resolve clang++ warnings
#325
chhwang
closed
3 months ago
0
Double buffering for NCCL APIs
#324
caiomcbr
closed
3 months ago
0
[Bug] (Ib failure: Cannot allocate memory) reported when run mscclpp-test/allreduce_test_perf with MPI on 2 nodes
#323
dong-liuliu
closed
2 months ago
4
AllReduce Kernel for Small Messages
#322
caiomcbr
closed
4 months ago
0
Separate NPKit CPU timestamp access from different blocks for AMD platform
#321
yzygitzh
closed
4 months ago
0
[Bug]mscclpp::DeviceSyncer::sync Assertion failed
#320
linstreamer
closed
2 months ago
1
Support NCCL APIs
#319
caiomcbr
closed
4 months ago
0
Update allreduce_bench.py
#318
angelica-moreira
closed
4 months ago
0
Simplify/improve barrier in AllReduce6
#317
roshandathathri
closed
4 months ago
0
Add support for multicast reduce insruction
#316
roshandathathri
closed
4 months ago
0
[Bug] Bugs in mp_unit_test.
#315
TonyWu199
closed
3 months ago
3
Update quickstart.md
#314
angelica-moreira
closed
4 months ago
0
Add "packet type" option for executor test
#313
Binyang2014
closed
4 months ago
0
Fix NPKit support for AMD
#312
yzygitzh
closed
4 months ago
0
How to use mscclpp as a backend in pytorch
#311
wangfakang
closed
4 months ago
4
Add NPKit GPU event support
#310
yzygitzh
closed
4 months ago
0
Cumulative Updates
#309
Binyang2014
closed
4 months ago
0
v0.5.1
#308
chhwang
closed
5 months ago
0
A bug fix
#307
chhwang
closed
5 months ago
0
[EXPERIMENTAL] Enable NPKit GPU events
#306
yzygitzh
closed
5 months ago
0
Fix security issue
#305
Binyang2014
closed
5 months ago
0
Add C++ executor test
#304
chhwang
closed
5 months ago
0
Fix assert declaration & add a compile test
#303
chhwang
closed
5 months ago
0
[Bug] __assert_fail declaration in mscclpp breaks "assert()" usage in host functions.
#302
Alkaid-Benetnash
closed
5 months ago
1
Rename executor.cpp to executor_py.cpp
#301
chhwang
closed
5 months ago
0
Upgrade gtest
#300
chhwang
closed
6 months ago
0
allowing separate logs file per rank
#299
caiomcbr
closed
2 months ago
0
v0.5.0
#298
chhwang
closed
6 months ago
0
Allow obtaining cuda stream handle from PyTorch stream when launching kernel
#297
aashaka
closed
6 months ago
0
Move pipeline to Azure org
#296
Binyang2014
closed
6 months ago
0
Resolve multi-nodes test failure issue
#295
Binyang2014
closed
6 months ago
0
Optimized the execution kernel
#294
Binyang2014
closed
6 months ago
0
Refactoring NVLS interfaces
#293
chhwang
closed
6 months ago
0
Include GPU data types only for kernel code
#292
chhwang
closed
6 months ago
0
Seperate headers for GPU data types
#291
chhwang
closed
6 months ago
0
Allow binding allocated memory to NVLS multicast pointer
#290
roshandathathri
closed
6 months ago
0
[Feature] `deviceHandle()` interface is counter-intuitive
#289
chhwang
opened
6 months ago
0
Using MSCCL++ smchannel/proxy_channel within Triton Kernels
#288
rajagond
closed
6 months ago
1
[Feature] Usage as backend in Pytorch
#287
azharlightelligence
closed
6 months ago
1
Fix a typo name
#286
chhwang
closed
6 months ago
0
[Bug] Program hangs at proxy channel `wait()`
#285
liangyuRain
closed
4 months ago
1
Ethernet support
#284
chhwang
closed
6 months ago
5
Add executor to execute schedule-plan file
#283
Binyang2014
closed
6 months ago
0
[Bug]mscclpp-tests dont exit after test.
#282
TonyWu199
closed
4 months ago
3
MSCCL++ v0.5.0 Release Plan
#281
chhwang
closed
6 months ago
1
Add design documentation
#280
chhwang
closed
2 months ago
1
v0.4.3
#279
chhwang
closed
7 months ago
0
Previous
Next