issues
search
microsoft
/
mscclpp
MSCCL++: A GPU-driven communication stack for scalable AI applications
MIT License
233
stars
30
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Add proxy channel related operations
#351
Binyang2014
opened
1 day ago
0
Is there exist some documentation to explain the difference between allreduce algorithm in mscclpp?
#350
MARD1NO
closed
2 days ago
4
[Bug] libmscclpp_nccl fails linking using ROCm 6.0
#349
corey-derochie-amd
opened
3 days ago
0
[Doc] mscclpp docs
#348
Binyang2014
opened
1 week ago
0
Fix for ROCm 6.0
#347
chhwang
closed
1 week ago
0
Add CI for rocm
#346
Binyang2014
opened
1 week ago
0
tune threads per block for mscclpp executor
#345
Binyang2014
opened
2 weeks ago
0
Support executors to send packets over ProxyChannel
#344
caiomcbr
closed
1 week ago
0
Fix for ROCm 6.0
#343
chhwang
closed
1 week ago
0
ProxyChannel Support in Executor
#342
caiomcbr
closed
2 weeks ago
0
Fix bug for construct sempaphore
#341
Binyang2014
closed
1 week ago
0
Make ibverbs optional at compile time
#340
chhwang
closed
3 weeks ago
0
Removing Ibverbs Dependency
#339
caiomcbr
closed
3 weeks ago
0
Auto-tune vector sizes for NVLS allreduce6
#338
roshandathathri
closed
4 weeks ago
0
Dynamically load libibverbs
#337
caiomcbr
closed
1 month ago
0
bfloat16 support
#336
chhwang
closed
1 month ago
0
[Bug] `mscclpp/concurrency_device.hpp: No such file or directory`
#335
TZHelloWorld
closed
1 month ago
2
Fix missing import in executor test
#334
yzygitzh
closed
1 month ago
0
Update quickstart.md
#333
chhwang
closed
1 month ago
0
Add support for different vector sizes in multimem instructions
#332
roshandathathri
closed
1 month ago
0
NCCL API Executor Integration
#331
caiomcbr
closed
1 month ago
0
NCCL API Executor Integration
#330
caiomcbr
closed
1 month ago
0
Executor integration for NCCL APIs
#329
caiomcbr
closed
1 month ago
0
v0.5.2
#328
chhwang
closed
1 month ago
0
Support to write packets via uint2
#327
Binyang2014
closed
1 month ago
0
Multi-stream CUDA IPC
#326
chhwang
opened
2 months ago
0
Resolve clang++ warnings
#325
chhwang
closed
2 months ago
0
Double buffering for NCCL APIs
#324
caiomcbr
closed
1 month ago
0
[Bug] (Ib failure: Cannot allocate memory) reported when run mscclpp-test/allreduce_test_perf with MPI on 2 nodes
#323
dong-liuliu
closed
1 month ago
4
AllReduce Kernel for Small Messages
#322
caiomcbr
closed
2 months ago
0
Separate NPKit CPU timestamp access from different blocks for AMD platform
#321
yzygitzh
closed
2 months ago
0
[Bug]mscclpp::DeviceSyncer::sync Assertion failed
#320
linstreamer
closed
1 month ago
1
Support NCCL APIs
#319
caiomcbr
closed
2 months ago
0
Update allreduce_bench.py
#318
angelica-moreira
closed
2 months ago
0
Simplify/improve barrier in AllReduce6
#317
roshandathathri
closed
2 months ago
0
Add support for multicast reduce insruction
#316
roshandathathri
closed
2 months ago
0
[Bug] Bugs in mp_unit_test.
#315
TonyWu199
closed
1 month ago
3
Update quickstart.md
#314
angelica-moreira
closed
2 months ago
0
Add "packet type" option for executor test
#313
Binyang2014
closed
3 months ago
0
Fix NPKit support for AMD
#312
yzygitzh
closed
3 months ago
0
How to use mscclpp as a backend in pytorch
#311
wangfakang
closed
2 months ago
4
Add NPKit GPU event support
#310
yzygitzh
closed
3 months ago
0
Cumulative Updates
#309
Binyang2014
closed
3 months ago
0
v0.5.1
#308
chhwang
closed
3 months ago
0
A bug fix
#307
chhwang
closed
3 months ago
0
[EXPERIMENTAL] Enable NPKit GPU events
#306
yzygitzh
closed
3 months ago
0
Fix security issue
#305
Binyang2014
closed
3 months ago
0
Add C++ executor test
#304
chhwang
closed
3 months ago
0
Fix assert declaration & add a compile test
#303
chhwang
closed
3 months ago
0
[Bug] __assert_fail declaration in mscclpp breaks "assert()" usage in host functions.
#302
Alkaid-Benetnash
closed
3 months ago
1
Next