Add PACE_DACE_DEBUG to trigger full debug of DaCe orchestrated (dropping debug pass in SDFG + CUDA syncs after each kernel)
Add trivial debug pass to track progress for integrated mode where stacktrace can be swallowed
Code changes:
Fix DaCe debug pass
Requirements changes:
N/A
Checklist
Before submitting this PR, please make sure:
[x] You have followed the coding standards guidelines established at Code Review Checklist.
[x] Docstrings and type hints are added to new and updated routines, as appropriate
[x] All relevant documentation has been updated or added (e.g. README, CONTRIBUTING docs)
[ ] Unit tests are added or updated for non-stencil code changes
Purpose
Add
PACE_DACE_DEBUG
to trigger full debug of DaCe orchestrated (dropping debug pass in SDFG + CUDA syncs after each kernel) Add trivial debug pass to track progress for integrated mode where stacktrace can be swallowedCode changes:
Requirements changes:
N/A
Checklist
Before submitting this PR, please make sure: