issues
search
awslabs
/
sagemaker-debugger
Amazon SageMaker Debugger provides functionality to save tensors during training of machine learning jobs and analyze those tensors
Apache License 2.0
161
stars
83
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
bugfix for timelinewriter
#460
NRauschmayr
closed
3 years ago
1
Adding support for pytorch 1.8
#459
leleamol
closed
3 years ago
1
Revert accidental commits to the master repo.
#458
leleamol
closed
3 years ago
0
Pre commit build
#457
NihalHarish
closed
3 years ago
1
[bugfix] Do not wrap models when the hook has a default_config
#456
NihalHarish
closed
3 years ago
1
Supporting PT 1.8
#455
leleamol
closed
3 years ago
3
bugfix for timelinewriter
#454
NRauschmayr
closed
3 years ago
0
Understanding of how sagemaker-debugger works
#453
anotinelg
opened
3 years ago
1
Bump version for next release to 1.0.6
#452
ndodda-amazon
closed
3 years ago
1
SMDDP should use size() and rank() for TF jobs
#451
ndodda-amazon
closed
3 years ago
2
version bump
#450
NihalHarish
closed
3 years ago
1
Filter Repeating Logs
#449
NihalHarish
closed
3 years ago
2
Fix bug for actions and improve design
#448
ndodda-amazon
closed
3 years ago
1
Enable SMDDP ZCC tests
#447
ndodda-amazon
opened
3 years ago
2
TF keras.py _wrap_tape_gradient breaks for arrays
#446
arewellborn
opened
3 years ago
0
Bump Smdebug Version
#445
NihalHarish
closed
3 years ago
2
Split inputs to save nested structures
#444
NihalHarish
closed
3 years ago
1
Remove Pre-commit from xgboost buildspec
#443
NihalHarish
closed
3 years ago
1
Install awscli in xgboost buildspec
#442
NihalHarish
closed
3 years ago
2
Pin Numpy Version For Vanilla Codebuild Container
#441
NihalHarish
closed
3 years ago
1
Revert "Split inputs to save nested structures (#427)"
#440
NihalHarish
closed
3 years ago
2
Test CI
#439
NihalHarish
closed
3 years ago
0
Version Bump
#438
NihalHarish
closed
3 years ago
1
force install smdebug in the xgboost container
#437
NihalHarish
closed
3 years ago
2
New Buildspec For TF 2.3.1
#436
NihalHarish
closed
3 years ago
2
Remove Spamming Warning Log
#435
NihalHarish
closed
3 years ago
3
Can we save tensors that match a regex pattern only for a particular collection
#434
NihalHarish
opened
3 years ago
0
Fix: Incompatible Numpy Version on CI
#433
NihalHarish
closed
3 years ago
1
New Xgboost Buildspec
#432
NihalHarish
closed
3 years ago
1
Use codebuild env variable to get current branch in profiler integration tests
#431
ndodda-amazon
closed
3 years ago
1
Refactor profiler config parser tests
#430
ndodda-amazon
closed
3 years ago
1
Add script for checking files changed in a PR
#429
ndodda-amazon
closed
3 years ago
5
Merge pull request #1 from awslabs/master
#428
sophiayue1116
closed
3 years ago
2
Split inputs to save nested structures
#427
NihalHarish
closed
3 years ago
1
Compatibility with gradient accumulation
#426
quasimik
opened
3 years ago
1
Redo of PR 411 Use Smp rank and size when applicable
#425
rahul003
closed
3 years ago
1
Revert "Use SMP rank and size when applicable"
#424
ndodda-amazon
closed
3 years ago
1
Test Updates For TF 2.4
#423
NihalHarish
closed
3 years ago
1
Modify distributed_training_utils.py import for TF 2.4
#422
NihalHarish
closed
3 years ago
1
Cache TF Versions
#421
NihalHarish
closed
3 years ago
0
Profiler tf native training
#420
sophiayue1116
opened
3 years ago
1
TypeError: os.environ.get() takes no keyword argument (breaking all PyTorch training jobs)
#419
robwhelan
opened
3 years ago
0
TypeError: get() takes no keyword arguments - breaks training jobs
#418
robwhelan
opened
3 years ago
5
Fixing the codebuild project for xgboost.
#417
leleamol
closed
3 years ago
1
Fix flaky timeline writer test
#416
ndodda-amazon
closed
3 years ago
1
Fix mixed precision import bug for TF 2.4.x
#415
ndodda-amazon
closed
3 years ago
1
Upgrade pip dependency
#414
NihalHarish
closed
3 years ago
1
Sagemaker debugger hooks for keras unet
#413
shubham-scisar
opened
3 years ago
0
Updating the version number to 1.0.2
#412
leleamol
closed
3 years ago
1
Use SMP rank and size when applicable
#411
rahul003
closed
3 years ago
2
Previous
Next