issues
search
cisco-open
/
pymultiworld
A framework for PyTorch to enable fault management for collective communication libraries (CCL) such as NCCL
Apache License 2.0
15
stars
4
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
misc: make m8d-send-recv script flexible
#42
myungjin
closed
2 months ago
0
feat: added all_gather example
#41
raresgaia123
closed
2 months ago
0
feat: all reduce docs
#40
raresgaia123
closed
2 months ago
0
chore(deps): bump step-security/harden-runner from 2.8.0 to 2.9.0
#39
dependabot[bot]
closed
2 months ago
0
fix: incorrect worldstatus list allocation
#38
myungjin
closed
2 months ago
0
refactor: updated examples folder structure
#37
raresgaia123
closed
2 months ago
1
feat: added all_reduce with multiple worlds
#36
raresgaia123
closed
2 months ago
0
nit: keyerror exception handling
#35
myungjin
closed
2 months ago
0
fix: DEFAULT_WORLD_NAME import error
#34
myungjin
closed
2 months ago
0
feat: added multiworld all_reduce example
#33
raresgaia123
closed
2 months ago
1
doc: update publication
#32
myungjin
closed
2 months ago
0
refactor: update collective operations' signatures
#31
myungjin
closed
2 months ago
0
misc: reformatting resnet example
#30
myungjin
closed
2 months ago
0
refactor: support for patch files
#29
raresgaia123
closed
3 months ago
0
fix: pytorch v2.2.1 patch
#28
myungjin
closed
3 months ago
0
fix: fixed stdout not being flushed
#27
raresgaia123
closed
3 months ago
0
refactor: support for patch files
#26
raresgaia123
closed
3 months ago
0
refactor: update asyncio example
#25
raresgaia123
closed
3 months ago
1
release: version bump-up to 0.0.4
#24
myungjin
closed
3 months ago
0
chore: clean up unnecessary code/files
#23
myungjin
closed
3 months ago
0
feat: support for more ccl operations
#22
myungjin
closed
3 months ago
0
feat: concurrent world initialization
#21
myungjin
closed
3 months ago
0
misc: Apache 2.0 license update
#20
myungjin
closed
4 months ago
0
chore: clean up obsolete patch file
#19
myungjin
closed
4 months ago
0
patch: pytorch patch revision for multiworld
#18
myungjin
closed
4 months ago
0
misc: v0.0.2
#17
myungjin
closed
4 months ago
0
patch: pytorch v2.2.1 patch file for multiworld
#16
myungjin
closed
4 months ago
0
chore(deps): bump step-security/harden-runner from 2.8.0 to 2.8.1
#15
dependabot[bot]
closed
2 months ago
1
chore(deps): bump the github group with 2 updates
#14
dependabot[bot]
closed
2 months ago
0
misc: pypi workflow update
#13
myungjin
closed
4 months ago
0
chore: clean up unnecessary text
#12
myungjin
closed
4 months ago
0
misc+doc: patch file for pytorch's nccl support
#11
myungjin
closed
4 months ago
0
Added pypi_release file.
#10
raresgaia123
closed
4 months ago
0
Updated docs with prerequisites.
#9
raresgaia123
closed
4 months ago
0
doc: revise readme
#8
myungjin
closed
4 months ago
0
doc: readme update
#7
myungjin
closed
4 months ago
0
misc: explicit exit
#6
myungjin
closed
4 months ago
0
temp fix: disable destory_process_group
#5
myungjin
closed
4 months ago
0
fix: markdown lint errors
#4
myungjin
closed
4 months ago
0
chore: contributors in README.md
#3
myungjin
closed
4 months ago
0
chore(deps): bump step-security/harden-runner from 2.7.1 to 2.8.0
#2
dependabot[bot]
closed
4 months ago
0
Add GitHub workflows
#1
svrnm
closed
4 months ago
0
Previous