issues
search
awslabs
/
sagemaker-debugger
Amazon SageMaker Debugger provides functionality to save tensors during training of machine learning jobs and analyze those tensors
Apache License 2.0
161
stars
83
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Update the Hook callback to be compatible with xgboost>1.3.0 callback style
#616
haixiw
closed
1 year ago
2
xgboost error exception replacement
#615
yl-to
closed
2 years ago
0
Bumping version to 1.0.20 for TF 2.10 compatibility
#614
dkey-amazon
closed
2 years ago
0
Refactored MXNet/PyTorch/TF Exceptions, and End-of-training log directory check
#613
jleeleee
closed
2 years ago
0
add licensing information
#612
atqy
closed
2 years ago
0
Core logging
#611
jleeleee
closed
2 years ago
0
Add unified RTD search to RTD website
#610
atqy
closed
2 years ago
0
Cannot run a custom container using smdistributed/dataparallel unless USE_SMDEBUG is turned off
#609
plamb-viso
opened
2 years ago
0
fix horovod.torch import error
#608
ztlevi
opened
2 years ago
0
Bumped version to 1.0.19
#607
johnbensnyder
closed
2 years ago
0
disabled zcc test for pt 1.12+
#606
johnbensnyder
closed
2 years ago
0
fix: Remove Python 3.8 identity test warning
#605
ntw-au
closed
12 months ago
1
add a check to determine if horovod.torch import succeeds
#604
zaoliu-aws
opened
2 years ago
0
Added public key retrieval, needed on older GPU instances
#603
mariumof
closed
2 years ago
1
Unconstrained pip version in requirements (valid in config/profiler only)
#602
mariumof
closed
2 years ago
0
Setup
#601
mariumof
closed
2 years ago
0
Upped version AND supported PT version
#600
mariumof
closed
2 years ago
0
make the processing function more robust to handle corrupted json lin…
#597
zaoliu-aws
closed
2 years ago
3
Updates S3 wheel locations for install
#596
MZSHAN
closed
2 years ago
0
Adds script to delete old nightly wheels
#595
MZSHAN
closed
2 years ago
0
jinja2 import issue fix
#594
mariumof
closed
2 years ago
0
Fix config
#593
mariumof
closed
2 years ago
1
add more descriptive message when json reading error happens
#592
zaoliu-aws
closed
2 years ago
2
Increments supported tf version to 2.9
#591
MZSHAN
closed
2 years ago
1
try smth
#590
adimux
closed
2 years ago
0
BugFix: Python Profiler
#589
MZSHAN
closed
2 years ago
1
Do not remove work dir mid test
#584
mariumof
closed
2 years ago
1
Increase hook load times
#583
mariumof
closed
2 years ago
0
Bumped up hook times for PT
#582
mariumof
closed
2 years ago
1
Fix tests
#581
mariumof
closed
2 years ago
1
test_pytorch_integration.py::test_pytorch[False-False] is incompatible with PyTorch >=1.7
#580
tejaschumbalkar
opened
2 years ago
0
Remove/adjust some pytorch tests owing to framework version change
#579
mariumof
closed
2 years ago
1
Config
#578
mariumof
closed
2 years ago
1
Add simplejson to vanilla
#577
mariumof
closed
2 years ago
1
package bump
#576
mariumof
closed
2 years ago
0
Packages
#575
mariumof
closed
2 years ago
1
Added a config for back version, changed some comments and updated so…
#574
mariumof
closed
2 years ago
0
Debug
#573
mariumof
closed
2 years ago
1
Add the option to remove stable_release env var, none of the other bu…
#572
mariumof
closed
2 years ago
1
Missing then
#571
mariumof
closed
2 years ago
1
typo
#570
mariumof
closed
2 years ago
1
Add empty build spec and fix vanill build
#569
mariumof
closed
2 years ago
1
Adjust
#568
mariumof
closed
2 years ago
0
Mark PT 1.11.0 as supported
#567
MZSHAN
closed
2 years ago
1
Test
#566
mariumof
closed
2 years ago
1
Fix rules
#565
mariumof
closed
2 years ago
1
Package version changes. Needed because aws_cli conflicts and newer t…
#564
mariumof
closed
2 years ago
0
Test timing
#563
mariumof
closed
2 years ago
1
Bumped version, to incorporate test fixes
#562
mariumof
closed
2 years ago
1
Dataset version
#561
mariumof
closed
2 years ago
0
Previous
Next