issues
search
pytorch
/
torchx
TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and support for E2E production ML pipelines when you're ready.
https://pytorch.org/torchx
Other
303
stars
96
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Ensure duplicate arguments are only checked within their respective argument groups
#911
hstonec
closed
12 hours ago
2
Update torchx_out_of_sync_training.py
#910
andywag
closed
6 days ago
0
Create a file for torchX OSS out-of-sync training
#909
yikaiMeta
closed
1 week ago
2
Update pyfmt component on FBS:master
#908
tpolasek
closed
1 week ago
4
fix: add utility capability for local_docker (#906)
#907
clumsy
closed
1 week ago
3
local_docker does not add utility nvidia libraries to containers
#906
clumsy
closed
1 week ago
2
feat: add aws_p5_48xlarge
#905
clumsy
closed
1 day ago
3
Use component full name in error message
#904
hstonec
closed
1 week ago
1
create minicube integration test
#903
yikaiMeta
closed
12 hours ago
0
Revert "Update api.py (#899)"
#902
kiukchung
closed
2 weeks ago
2
Create minicube integration test
#901
yikaiMeta
closed
1 week ago
0
Update README.md
#900
KPCOFGS
opened
2 weeks ago
0
Update api.py
#899
yikaiMeta
closed
2 weeks ago
0
feat: add privileged option to local_docker (#897)
#898
clumsy
opened
2 weeks ago
3
privileged flag for local_docker scheduler
#897
clumsy
opened
2 weeks ago
0
Update pyfmt component on FBS:master
#896
amyreese
closed
3 weeks ago
0
DO NOT APPROVE. Update gcp-batch-integration-tests.yaml
#895
yikaiMeta
opened
3 weeks ago
0
Fixed cmd_run issue with boolean inputs ( S412679)
#894
andywag
closed
3 weeks ago
4
ReDos Vulnerability on Torchx
#893
aydinnyunus
opened
3 weeks ago
2
Migrate components-integration-tests to Terraform
#892
yikaiMeta
closed
3 weeks ago
3
add workspace to TorchXEvent for logging
#891
ishachirimar
closed
3 weeks ago
3
add workspace to TorchXEvent for logging
#890
ishachirimar
opened
3 weeks ago
1
Fix issue with torchx status failing when return error is string instead of json
#889
andywag
closed
3 weeks ago
1
Fix for issue with multiple -- in command line
#888
andywag
closed
4 weeks ago
5
Added error with repeated component arguments
#887
andywag
closed
4 weeks ago
3
Update aws-batch-integration-tests.yaml
#886
yikaiMeta
closed
3 weeks ago
0
log start time to TSM logger scuba table
#885
ishachirimar
closed
4 weeks ago
6
fFx torchx runner schedule mock in torchx runner api test
#884
ishachirimar
closed
1 month ago
1
Refactor PopenHandler API for better subclassing for replica popen handling
#883
cniii
opened
1 month ago
3
add start epoch time to TorchXEvent
#882
ishachirimar
closed
1 month ago
1
Remove torchx component args env var (#880)
#881
ishachirimar
closed
1 month ago
1
Remove TORCHX_COMPONENT_ARGS env var from role
#880
ishachirimar
closed
1 month ago
3
Zip Slip Vulnerability on Torchx Examples
#879
aydinnyunus
opened
1 month ago
0
no fail on codecov errors
#878
d4l3k
closed
1 month ago
0
[test only DO NOT APPROVE] Update components-integration-tests.yaml
#877
yikaiMeta
closed
3 weeks ago
0
Fix too long locker_docker scheduler hostname
#876
asuta274
closed
1 month ago
0
Fix Nightly push permissions
#875
ryxli
closed
1 month ago
6
feat: add verbose flag to docker mixin
#874
clumsy
closed
1 month ago
12
Adding health check settings to Role spec
#873
gag1jain
closed
1 month ago
7
Fix torchx nightly regression
#872
andywag
closed
1 month ago
3
remove log_dir from LaunchConfig
#871
yikaiMeta
closed
1 month ago
3
Make torchx scheduler opts support enums
#870
andywag
opened
1 month ago
9
Update pyfmt component on FBS:master
#869
yikaiMeta
closed
1 month ago
1
Fixed Multiple Args Testing Issue
#868
andywag
closed
1 month ago
4
fix nightly.yaml
#867
yikaiMeta
closed
1 month ago
3
new requirements file for formatter versions
#866
amyreese
closed
1 month ago
2
Support home directory sign (~) in BindMount path
#865
asuta274
closed
1 month ago
0
fix google batch logs
#864
d4l3k
closed
1 month ago
0
[ci-fix] Fix failing doc-build due to sagemaker scheduler from https:…
#863
kiukchung
closed
1 month ago
0
Update black version in dev-requirements.txt
#862
yikaiMeta
closed
1 month ago
0
Next